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FUSION PROTEINS COMPRISING A DIMERIZATION, J^^^^^^^ol^ st^O 
ADDITIONAL HETEROLOGOUS TRANSCRIPTION ACTIVATION. TRANSCRIPTION REPRESSION, u 

OR LIGAND BINDING DOMAIN 

Background of the Invention 

Activation of transcription of a euKaryot.c gene involves tne interaction o a vane., 
of proteins to form a complex that is reoru.ted to the gene through prote,n:DNA ,nteract,ons. 
Key protein doma.ns on one or more of the components include transection a«n 
.ornLs and DNA binding domains. Elucidating the mechan.sm of transcription. Ktottfy.no 
anc characterizing components of the transcriptional machinery and ,n some cases 
narness,ng some of those components have been the subject of extens.ve ^search. 
See. e.g.. Brent and Ptashne. 19 85; Hope and Struhl, 1986; Keegan et a . ^ Relds 
and Song, 19 89; Spencer et a!. 1993. Belshaw et a.. 1996 and Rivera et al. 1996)(A 
Bibliography is provided just pnor to the Examples, below.) 

Transcription act.vat.on oomains are tnought to function by recasting a numoer of 
M prote,ns with specific functions to tne promoter (Lin and Green, 1991 ; Goodrich et al, 1993; 
Orphanides et al. 1996 and references cited therein; Ptashne and Gann. 1997 and 
references cited therein). Among the large number of activation domains that have Ipeen 
characterized to date, the aadic-activation Domain of the Herpes Simplex v.rus encoded 
protein VP16. is ccns.oered to be a very strong .nducer of transcription and ,s widely used 
25 n bio ogica, research (Sadowski et al, 1988. Ptashne and Gann. 1997). 

activation doma.n o, the p65 subun.t cf the human transcnption factor NF^sa = 
potent stimulator of gene express-on, and in certain contexts can induce 
strongly than VP16 (Schm.tz and Baeuene, 1991; Ballard et aJ. 1992; Moore at al. 1993. 
Blair et al 1994; Natesan et al. 1997). Both the VP16 and p65 activation domains are 
30 thought to function by interacting with and recanting a number of prote.ns to the promoter 
(Cress and Tnezenberg. 1990; Scm.tz at al. 1994; Uesugi et al. 1997). 

One of the remarkable features of such activation domains is that lus.ng them to 
heterologous prote.n corner seldom affects their ability to activate transcnption when 
recnuiteO to a wide vanety of promoters. The h.gh degree of functional 
35 exhibited by these activation doma.ns makes them valuable tools ,n various ^ca, 

assays for Ina.yz.ng gene expression and prote.n-protem « «^^-» 
moiecule orug interactions (Fields and Song. 1989; Senguptha et al, 1996. R vera e, al. 
19 96; Tnezenberg. 1995 ano references cited therein). The ability to activate gene 
expression strongly and when recruited to a wide range of promoters makes bo t. p65 ano 
40 VP16 attractive candidates for activation of gene transcnot.on in gene therapy and , other 

applications. However, even more potent act,vat,on contains. ,f available, would be use,, 
lor achieving higher levels of transcnption on a per cel. basis, and for improving the 
eL ency of the many b,o,og,ca. assays that re.y upon act,vat,on of transcnp.on of a 
reporter gene. 
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Several strategies to improve the potency ot activation aomains and thereby the 
express.on of genes under their control have been reported (Emami and Carey. 1992, 
Gemeratal 1994; Ohashi et al, 1994; Blair at a!. 1996; Tanaka et al. 1996). These 
approaches generally involve mcreas.ng the number o< cop.es of act.vat.on aomains fused 
to the DNA binding domain or generating activators conta.n.ng synerg.zing combinations of 
activation domains. Although some activators generated by these methods have been 
shown to be more potent, a number of limitations preclude their w.aespread application 
First potent activators comprising reiterated activation domains do not increase the 
absolute levels of recorter gene expression when tested on promoters with multiple 
binding s.tes for the activator (Emam, and Carey. 1992). Second, a number of synergic 
combinations of activation domains reported in the literature .nvolve weaK activation 
aomams and the absolute levels of gene express.on induced by these synerg.zmg 
activation domains are much .ower compared to potent acd.c act.vat.on domains from VP 16 
or p65 (Gerber at al. 1 994; Tanaka et al. 1 996). Third, it is not known wnethe- any of these 
potent activat.cn domains are capable of inducing gene transition strongly when they 
are non-covalently linked to the DNA binding domain. Fourth, many potent activators 
containing multiple copies of VP16 or other acidic activators are highly toxic and/or 
accumulate to only low levels in the cell. 

As mentioned at the outset, a variety of important applications involv.ng gene 
transcription require or would benefit from higher levels of gene expression. As noted 
above however, efforts to improve the potency of activation comains have been 
disappointing. Moreover, expression of various transcnpt.on activators revealed that 
observed levels of more potent activators, such as the p65 un.t of NF-kB. are lower thar. 
expected. Without wishing to be bound by any one theory, we suggest that the more 
potent the activation domain, the more toxic it is to the cell, the more dislavored ,s its 
expression and'or the less of it is observed to accumulate in cells. How. then, is ,t 
possible to increase levels of heterologous gene expression? RematKably. we have 
found that it is still possible to outmaneuver these facts of nature to improve heterologous 
gene expression and have in tact done so using the pnnciples of -bundling", the 
engineering of the transcription activation domain, and combinat.ons thereof, as described 

below. 



Summary ot the Invention 

This document discloses new improvements in the design ana delivery ct 
35 transcription activation domains and provides .mproved materials and methods for 

reaulatma the transition of a target gene. Aspects of the mvent.on are applicable >o 
systems Involving either covalent or non-covalent linking of the transcription activation 
domain to a DNA binding domain. 

Key features of the invent.on indude "bundling" domains, fus.on proteins containing 
« them. reoomb.nant nucleic acids encoding such fusion prote.ns. systems .nvolving bundles 



2 



PCT/L'S98'17-'23 

WO 99/10510 



ot sucn ius.on proteins, and other materials and methods involving such bundling domains. 
Key fusion proteins of the .nvention contain at least two mutually heterologous doma.ns. 
one of which being a bundling domain. An .mpohant design concept is tnat the fus.on 
proteins do not need to act alone. Instead, they find and b.nd to each other (or with other 

R proteins containing the bundling domain) to form a posse to accomplish their m.ss.on. In 
practice cells are engineered by the introduction of recombinant nucleic acds encooing the 
fus.on proteins, and in some cases with additional nucleic acid constructs, to render them 
capable of ligand-dependent regulation of transcription of a target gene. Administration of 
the hgand to the cells then regulates (positively, or in some cases, negatively) target gene 

10 transcription. 

Detailed information concerning bundling domains, guidance on their use and 
illustrative examples are provided below. Generally speaking, bundling domains include 
any domain that induces proteins that contain it to form multimers ("bundles") through 
protein-protein interactions with each other or with other proteins containing the bundling . 
,c domain Examples of bundling doma.ns that can be used in the practice of this invention 
include domains such as the lac repressor tetramerizafion domain, the p53 tetramenzat>on 
domain a leucine zipper domain, and domains derived therefrom which retain observable 
bundling activity. Proteins containing a bundling domain are capable of complexing with 
one another to form a bundle of the individual protein molecules. Such bundling is 
20 -constitutive- in the sense that it does not require the presence of a cross-link.ng agent 

(i.e.. a cross-linking agent wh,ch doesn't itself contain a proteinaceous bundling domain) to 
link the protein molecules. 

Illustrative (non-limiting) examples of heterologous doma.ns which can be included 
along with a bundling domain in vanous fus.on proteins of this invention include 
2 , transcription regulatory domains (i.e.. transcription activation domains such as a p65, VP16 
or AP domain; transcription potent.at.ng or synerg.zing domains; or transaction repress.on 
domains such as an ssn-6mJP-1 domain or Kruppel family suppressor domain); a DNA 
binding domain such as a GAL4. lex A or a composite DNA binding domain such as a 
composite zinc finger domain or a ZFHD1 domain; or a ligand-bind.ng domain compns.ng or 
derived from (a) an .mmunophilin. cyclophilin or FRB domain; (b) an antibiotic binding 
domain such as tetR: or (c) a hormone receptor such as a progesterone receptor or 
ecdvsone receptor. 

A w,de variety of lioand binding domains may be used in th.s invention, althoug. 
ligand binding domains which b,nd to a cell permeant ligano are preferred. It is also 
preferred that the Hgand have a molecular weight under about 5kD, more preferably below 
2 5 kD and optimally below aoout 1500 D. Non-proteinaceous ligands are also prelerred. 
Ligand binding doma.ns .nclude, for examole, doma.ns selected or derived from (a) an 
immunophil.n (e.g. FKBP 12), cydoph.iin or FRAP domain; (b) a hormone receptor such as 
a receptor for progesterone, eodysone or another steroid, and (c) an antibiotic receptor 
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such as a tetR domain lor binding to tetracycline, doxycycline or otner analogs or m.mics 

thereof. . 

Examples ol liaand bina.ng doma.n/ligand pairs that may be used in the practice ot 
this invent,on include, but are not limited to: FKBP:FK1012, FKBP:synthetic divalent FKBP 
ligands (see WO 95'0609 and WO 97/31898), FRB:rapamycin/FKBP (see e.g., WO 
96/41865 and Rivera e: a!, "A humanized system for pharmacologic control of gene 
exoress.on-. Nature Med;c;ne 2(9):1028- 10 32 (1997)), cyclophilinrcyclcsporin (see > e.g. 
WO 94/18317), DHFR.methotrexate (see e.g. Licitra et al, 1996. Proc. Natl. Acad. Sc. USA 
Q3-12317-12821) TetR tetracycline or doxycycline or other analogs or mimics tnereof 
lessen and Bu,ard. 1992, Proc. Natl. Acad. Sc. U.SA 89:5547; Gossen et al. 1995, 
Science 268:1766-1769: Kistner et al, 1996. Proc. Natl. Acad. Sci. USA 93:10933-10938). a 
progesterone receptor:RU486 (Wang et al, 1994. Proc. Natl. Acad. Sc.. USA 
9t -81 80-8184) ecdysone receotor.ecdysone or muristerone A or other analogs or m.m.cs 
tnereof fNo et al, 1996. Proc. Natl Acad.. Sci. USA 93:3346-3351) and DNA 
gyrase:coumermvcin .see e.g. Farrar et al. 1996, Nature 383:178-181). 

A wide variety of DNA binding domains may be used tn tine pract.ee of this 
.nvention including a domain selected or denved from a GAL4. lexA or composite (e.g 
ZFHD1) DNA binding domain, or a DNA bmd.ng domain, e.g.. in comb.nat.on wrth l.gand 
binding domains such as a wt or mutated progesterone receptor domain. TetR domains, 
which provide both DNA binding and Ugand binding functions, are discussed in the context 
of ligand binding domains. In many applications it is preferable to use a DNA b.nd.ng 
domain which .s heterologous to the cells to be eng.neered. Heterologous DNA b.nding 
domains include those which occur naturally in cell types other than the cells to be 
engineered as well as composite DNA binding domains containing component portions 
which are not found in the same continuous polypeptide or gene in nature, at least not .n 
the same order or onentatk>n or with the same spacing present in the composite doma.n. In 
the case of composite DNA binding domains, component peptide portons wh.ch are 
endogenous to the cells or organism to be eng.neered are generally preferred. 

in the case of the ch.meric transcription factors containing a tetR domain, tne DNA 
binding domain .s provided by the tetR component, and .s by its nature heterologous to 
eukaryotic cells. TetR doma.ns are discussed in further detail in the context of ligand bmo.ng 
domains. 

in embodiments in wh.ch an enoogenous gene is to be regulatably expressed, a 
compos.te DNA binding domain which .s selected for recogn.t.on of one cr more sequences 
35 upstream of the target gene may be deployed. 

Additional ,n»ormat,or. concerning DNA binding domains is provided below. 
In an .mportant appi.cation of this invent.on, Uvo or more of the fusion proteins >n the 
bundle each compnse, in add.tion to the bundl.ng domain, at least one transcription 
actrvat.cn domain which is heterologous to the bundl.ng domain. Bundling ol proteins 
40 conta.n.ng transcription activation domains can significantly increase their effective potency 
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(relative to a single such lusion protein lacking a bundling doma.n) and consequently leads 
to strong .nduction of gene expression. Unlike their counterparts lacking a bundl.ng doma.n. 
fusion proteins containing a bundling domain are designed to achieve effective local 
concentrations of transcription activation domains and to robustly induce gene express.on 
= when recruited en masse to an expression control sequence-even despite relatively low 
overall levels of expression or accumulation of the fusion proteins. Highly potent bundled 
activation doma.ns can also be used in a wide variety of assays hav.ng transcriptional 
read outs. Such assays include assays for identifying protein-protein interactions (or 
inhibitors thereof) in a eukaryotic, preferably mammalian, two-hybrid assay or variant 
o thereof, e.g., three-hybrid assay, reverse two-hybrid assay, etc. 

Bundling doma.ns may be introduced into the design of fusion proteins of a vanety 
of regulated gene expression systems, .ncluding both allostery-based systems such as 
those regulated by tetracycline, RU486 or ecdysone, or analogs or mimics thereof, and 
d.merizat.on-based systems such as those regulated by divalent compounos l.ke FK1012, . 
,5 FKCsA, rapamyein. AP1 510 or coumermycm, or analogs or mimics thereof, all as described 
below (See also. Clackson. 1997. Controlling mammalian gene expression with small 
molecules, Current Opinion in Chem. Biol. 1 210-218). The fusion proteins may comprise 
any combination of relevant components, including bundling domains, DNA bind.ng 
domains, transcription activation (or repression) domains and ligand binding ooma.ns. 
20 Other heterologous domains may also be included. 

Various embodiments of this invention involve fusion proteins which contain at 
least one bundling domain. DNA binding domain and transcription activation coma.n; at 
least one bundling doma.n. ligand binding domain and transection repression doma.n; at 
least one bundling doma.n. ligand binding domain and DNA binding doma.n; at least one 
2= bundling domain, liaand binding domain. DNA binding domain and transcription act.vat.on 
domain; and. preferably, at least one bundling domain, ligand binding domain and 
transcription activation domain. In currently preferred embodiments, these fusion prote.ns 
represent improvements on the type described in W094/18317 and W096/41865, 
wherein the ligand b.nd.ng domain is or is derived from a cyclophilin, immuncph.l.n (e.g. an 
30 FKBP domain) or FRB domain-although, any ligand bind.ng domain may be used in the 
chimeric proteins, and the regulatory mechanism can be d.menzation- or allostery-based. 

A preferred fusion protein contains a lac repressor tetramenzation domain, an FRB 
domain and a transcription activation domain denved from the activation oomain of human 
P 65 It should be appreciated that in any of the embodiments of this invention involving a 
35 fusion protein conta.n.ng at least one transcription actrvation doma.n denved from p65. 

whetner with or without a bundl.ng domain, the P 65 peptide sequence may be a naturally 
occurring p65 sequence or may be engineered as described below 

Another aspect of this invention involves improvements in the transcnption 
activation domain itself. In this regard, recombinant nucleic acids are provided which encode 
fusion proteins containing a transcription activation doma.n and at least one additional 
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comam that is heterologous thereto, where the peptiae sequence of the activat.on coma.n 
is itself modified relative to the naturally occurring sequence from which it was denved to 
increase or decrease its potency as a transcnpt.onal activator relative to the counterpart 
comprising the native peptide sequence. Certa.n embodiments of this invention involve 
fusion proteins containing a transcription activation domain derived from p65 and beanng 
one or more of the mutations shown in Figure 7. Fusion proteins containing one or more 
modified activation domains can also contain a bundling domain to further increase fhe.r 
efficacv as transcriptional activators, and/or one or more additional domains such as a 
ligand bmdmg domain. DNA bmo.ng domain or transition activation synergizing aoma.n. 
such as are noted above and as discussed below. 

The invention thus provides recombinant nucleic acid constructs which encode the 
various proteins of this invention or are otherwise useful for practicing it, various DNA 
vectors containing those constructs for use in transducing prokaryotic and eukaryotic cells, 
cells transduced with the recombinant nucleic acids, fusion proteins encoded by the above 
recombinant nucleic acids, and target gene constructs. 

Also provided are nucleic acid compositions compnsing two or more recombinant 
nucleic acids which, when present within a cell, permit transcnption of a target gene, 
preferably following exposure to a cell permeant ligand. These compositions are illustrated 
as follows: 

20 Composition #1 . A first such composition comprises a recombinant nucleic acid 

encoding a fusion protein compnsmg at least one ligand bind.ng domain, bundling domain 
and transcription activation domain; a second recombinant nucleic acid encoding a fusion 
protein compris.ng a DNA binding domain and at least one ligand binding coma.n; and an 
optional third recombinant nude.c acid compris.ng a target gene (or cloning site) operat,vely 
2= linked to an expression control sequence including a DNA sequence recognized by the 

DNA binding domain mentioned above. Such compositions are illustrated by embodiments 
in which the ligand bindinc domains are or are denved from immunophilin, cyciophil.n or FRB 
domains; the transcnption activat.on domain is or is denved from an activation domain sucn 
as a VP1 6 or p65 domain; and the bundling domain is or is derived from a lac repressor 

30 tetramerization domain. 

Composition #2. Another such composition is similar to Composition #1 except 
that the fusion protein encoded by the first recombinant nucleic acid comprises at least one 
ligand binding domain, bunding domain and DNA bind.ng doma.n. and the fus.cn protein 
encoded by the second recombinant nude.c ace composes a transcnpt.on activat.on 
2^ domain and at least one ligand binding coma.n. 

Composition #3. Another such composition comprises a recombinant nucleic aco 
encoding a fusion protein compnsing at least one ligand binding coma.n, bundling domain 
and transcnption activation domain; a secono recomb.nant nude.c acid encod.ng a prote.n 
compnsing a DNA binding doma.n; and an optional third recombinant nude.c acid compnsing 
40 a target oene (or doning site) operatively linked to an expression control sequence 
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including a DNA seauence recognized by the DNA binding domain mentioned above. Such 
compositions are illustrated by embodiments in which the ligand b.nd.ng domains are or are 
derived from a receptor domain such as an ecdysone receptor, the DNA binding oomain is 
or is derived from a DNA binding domain such as an RXR protein, chosen for its ability to 
5 bind to the receptor domain in the presence of a ligand for that receptor; the transcription 

activat.on doma.n is or is derived from an activation oomain such as a VP16 or P 6d comam. 
and the bundling domain is or is denved from a lac repressor tetramerization domain. 

Composition #4. Another such composition comprises a recombinant nucleic add 
encodina a fusion protein comorismg at least one ligand binding doma.n. DNA binding 
no domain bundling aomam and transcription activation domain (where the i.gand binding 

domain and DNA binding domain may be pan of or oerived from the same domain); and an 
optional second recombinant nucie.c acio comprising a target gene (or clon.ng site) 
operatively linked to an expression control sequence inctud.ng a DNA sequence 
recognized by the DNA binding comam mentioned above. Such compositions are 
n5 illustrated by embodiments in which the ligand binding and DNA binding domains are or are 
denved from a receptor domain such as a tetracycline receptor which is capable of bmd.ng 
to a characteristic DNA sequence in the presence of tetracycline or another ligand for the 
receptor; the transcription activation domain is or is denved trom an activation doma.n such 
as a VP16 or p65 domain; and the bundling domain is or is derived from a lac repressor 
2 o tetramerization domain. Such compositions are further illustrated by embodiments in which 
the ligand binding comam is or is derived from a receptor domain such as a progesterone 
receptor which is capable of binding to progesterone or analogs or mimics thereof, .nciud.ng 
RU486- the DNA binding doma.n is or is derived from a GAL4 or composite DNA b.nd.ng 
domain; the transition activation domain is or is derived from an activation doma.n such 
25 as a VP16 or p65 domain; and the bundling domain is or is derived from a lac repressor 
tetramerization domain. 

Composition #5. Anotner such composition, which unlike Compositions 1 - 4 is 
designed for constitutive expression rather than lor l.gand-mediated regulation of 
transition, compnses a recombinant nucleic acid encoding a fusion protein comprising at 
30 least one DNA binding domain, bundling domain and transcription activation doma.n; and a 
second recombinant nucleic acid comprising a target gene (or cloning site) operatively 
linked to an expression control sequence including a DNA sequence recognized by the 
DNA b.nd.ng domain mentioned above. Such compositions are illustrated by embodiments 
in which the transition activation oomain is or is derived from an activation domain such 
-5 as a VP 16 or p65 doma.n; the DNA binding doma.n is or is Oenved from a GAL4 or 
composite DNA bind.ng domain; and the bundi.ng domain is or is oenved from a lac 
repressor tetramenzation domain. 

Compositions 1 . 3. 4 and 5 may further compnse an additional recombinant nucleic 
acid encoding a fusion protein compns.ng a bundling domain and at least one transition 
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activation domain or transcription synergizing donnain. with or without one or more clonal 
additional domains. 

Each of the recombinant nucleic acids of this invention may further comprise an 
expression control sequence operably l.nked to the coding sequence and may be provioeo 
within a DNA vector, e.g., for use in transducing prokaryotic or eukaryot.c cells. Some or all 
of the recombinant nucleic acids of a given composition as described above, including any 
optional recombinant nucleic acids, may be present within a single vector or may be 
apportioned between two or more vectors. In certain embodiments, the vector or vectors 
are viral vectors useful for producing recombinant viruses containing one or more of the 
recombinant nucleic acids. The recombinant nucleic acids may be provided as inserts 
withm one or more recombinant viruses which may be used, for example, to transduce cells 
in vitro or cells present within an organism, including a human or non-human mammalian 
subject For example, the recombinant nucleic acids of any of Compositions 1 - 5, including 
any ootional recombinant nucleic acids, may be present within a single recombinant virus, 
o' within a set of recombinant viruses, each of which containing one or more of the set of 
recombinant nucleic acids. Viruses useful for such embodiments include any virus useful for 
gene transfer, including adenoviruses, adeno-associated viruses (AAV), retroviruses, 
hybrid adenovirus-AAV. herpes viruses, lent, viruses, etc. In specific embodiments, the 
recombinant nucleic acid compnsing the target gene is present in a first virus and one or 
more or tne recombinant nucleic acids encoding the transcnption regulatory protem(s) are 
present in one or more additional viruses. In such multiviral embodiments, a recombinant 
nucleic acid encoding a fusion protein comprising a bundling domain ana a transcnption 
activation domain, and optionally, a l.gand binding domain, may be provided in the same 
recombinant virus as the target gene construct, or alternatively, on a third virus. It should 
25 be appreciated that non-viral approaches (naked DNA, liposomes or other lipid 

compositions, etc.) may be used to deliver recombinant nucleic acids of this invention to 

cells in a recipient organism. 

The invention also provides methods for rendering a cell capable of regulated 
expression of a target gene which involves introducing into the cell one or more of the 
recombinant nucleic acids of this invention to yield engineered cells which can exoress the 
appropnate fusion prote.n(s) of this invention to regulate transcription of a target gene. The 
recombinant nucleic acid(s) may be introduced in virai or other form into cells maintained in 
vitro or into cells present within an. organism. The resultant engineered cells and the.r 
progeny containing one or more of these recombinant nucleic acids or nucleic acid 
3- compositions of this invention may be used in a vanety of important applications 

discussed elsewhere, including human gene therapy, analogous vetennary applications, 
the creation of cellular or animal models (including transgenic applications) and assay 
applications. Such cells are useful, for example, in methods mvolvtng the addition of a 
ligand preferably a cell permeant ligand, to the cells (or administration of the Ugand to an 
.o organism containing the cells) to regulate expression of a target gene. Particularly important 
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animal models include rodent (especially mouse and rat) and non-human pnmate models. In 
gene therapy applications, the cells will generally be human and the peptide sequence of 
ea-n of the vanous domains present in the fusion prote.ns (with the possible exception of 
the bundling domain) will preferably be, or be derived from, a peot.de sequence of human 

origin. . . 

In certain assay aophcations, recombinant nucleic acids are designed as oescnbed 

for Composition #1 , except that the ligand binding domains of the fusion proteins are 
replaced with protein domains that are known to b.nd to each other. Cells transduced w,th 
these recombinant nucleic acids and with a matched target gene construct express a target 
gene typically selected for convenience of measurement of expression level. These cells 
can be used to identify the presence of a substance which blocks the interaction of the two 
protein domains which are known to interact. 

In other 2-hybnd-tvpe applications aimed at the identification of genes encoding 
proteins which interact with a prote.n or protein domain of interest, cells are transduced with 
similar recombinant nucleic ac.ds as described immediately above, except that a library of 
test nucleic acid sequences of potential interest is cloned into one of the recombinant 
nucleic acids encoding one of the fusion proteins. A 2-hybnd style assay is conducted in 
which transcription of the target gene indicates the presence of a test nucleic and sequence 
which encodes a domain that interacts with the protein domain in the cognate fus.on 
20 protein. 

Reverse 2-hybrid-type assays may be conducted analogously using cells 
engineered to positively or negatively regulate expression of a reporter gene as a result of 
-2-hybricT formation. The cells are exposed to one or more test substances, and inhibition 
of regulation of expression is taken as an indication of possible inhibition of the 2-hybnd 
25 formation. 

Brief Description of the Figures 

Abbreviations used in the Figures: 
30 G = yeast GAL4 DN A binding domain, ammo acids 1 -94 
F = human FKBP12. amino acids 1-107 
r = FRB domain o! human FRAP, ammo acids 2C25-2113 

S = activation domain trom the p65 suDun.t of human NF-kB. ammo acids 361-550 
V = activation domain from Herpesvirus VP16. ammo acids 410-494 
35 L = E. coli lactose repressor, ammo acids 46-360 

MT = Minimal Tetramenzat.on (-bundling-) coma.n o! E. col, lactose repressor, amine acds 324- 

360 

FIG 1 Diagram comoanng vanous fus.on proteins, with and without bundling domains, and 
40 their use in vanous strategies tor delivery of act.vaton domains to the promoter of a target 
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gene (A) two fusion proteins, one containing a DNA binding domain (e.g. a GAL4 or 
ZFHD1 DNA bind.ng domain) fused to an FKBP12, and the other containing a D65 
act.vat.on doma.n fused to an FRB, are expressed in cells. Addition of rapamycin leads to 
the recruitment of a singe action doma.n to each DNA bind.ng aoma.n monomer. (B) 
Fusion of multiple FKEPs to the DNA binding domain allows rapamyan to recrurt multiple 
activation doma.ns to each DNA b.ndmg doma.n monomer. (C) Addition of the lactose 
repressor tetramer.zation doma.n to the FRB-activat.on doma.n fusion allows rapamyan to 
recruit four activation domains to each FKBP fused to the DNA binding doma.n (D) 
Rapamycin recruits bundled act.vat.on domain fusion prote.n to each of the FKBP-DNA 
binding doma.n fusion prote.ns. (E) and (F) illustrate a mutated tetR-based system, without 
and with bundling. (G) and (H) illustrate an engineered progesterone-R-baseo system, 
without and with bundling. 

FIG 2 Expression levels c« tne stably integrated reporter gene correlate with the number 
of action domains recru.teo to the promoter. The indicated DNA binding doma.n and 
activation domain fusions were transfected into HT1 080B cells conta.n.ng a stably 
integrated SEAP reporter. Mean values of SEAP activity secreted into the medium following 
addition of 10 nM rapamyc.n are shown (W- S.D.). In all cases. SEAP expression values 
are plotted for cultures receiving 100 ng of activation domain expression plasm.d, which 
gives peak expression values in transiently transfected cells and slightly below peak 
levels in the stably transfected cell line. 

FIG 3 Synergy between the activation domains in the RLS bundle is tne primary cause 
for the super-activat,on of the reporter gene expression, a) Schematic illustration of the 
composition of the prote.n bundles of RLS with .ncreasing concentration of co-expressed 
LS or L in the cell, b) Twenty nanograms of GF1 encoding plasmid was co-transfected with 
100 ng of R! S alone or with indicated concentrations of LS or L regions. The cells were 
stimulated with 10 nM rapamycin and the SEAP activity in the medium was measured 1 8 
hrs after transfection. Mean values of SEAP activity secreted into the medium following 
, addition of rapamycin are shown (W- S.D.). c) Western blot analyses using 12CA5 

antibody against hemagglutinin epitope of venous recombinant prote.ns expressed in the 
transfected cells is shown 

FIG 4 A thirty-six am.nc acid region in the carboxy terminal ot the lactose repressor 
protein is sufficient for generating highly potent and bundled activation doma.n fusion 
proteins HT1080 B ceils were co-transfected wrth 20 ng GF1 and 100 ng of indicated 
activation domain containing plasmid vectors. Transcription of the reporter gene was 
stimulated by the addition of 10 nM rapamyan in the med.um. Mean values of SEAP 
activity secreted into the med.um assayed 24 hrs after transiect.on are shown W- S.D.) 
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FIG- 5 Tethering bundled activation domain fusion proteins to DNA binding proteins 
significantly reduces the amount of reconstituted activators required to strongly stimulate 
the target gene expression, a) Twenty nanograms of GF4 and indicated concentrations of 
activation aomam expressing plasmids were transfected into HT1080 B cells. Transcnption 
of the stably integrated reporter gene was induced by the addition of 10 nM rapamycin in 
the medium, b) Western blot analysis of the relative expression levels of the transtected 
transcnption factors, c) Twenty nanograms of GF4 and one hundred nanograms of the 
indicated activation domain fusion protein encoding plasmids were co-transfected into 
HT1080 B cells and the transcriptional activity of the GAL4 responsive reporter gene was 
induced by the addition of indicated concentrations of rapamycin in the medium. In all 
cases, mean values of SEAP activity secreted into the medium 24 hrs after the addition of 
rapamycin are shown (+/- S.D.). 

FIG. 6 Bundling the target-activation domain fusion protein improves the sensitivity of the 
two-hybrid assay in mammalian cells. Diagram showing two-hybrid assay using bundlec 
fusion protein containing the target and activation domains. GAL4 DNA binding domain 
fused to c-Cbl (GCBL) is shown interacting with its target protein SH3 fused to either a) 
VP1 6 activation domain (SH3S) or b) lactose repressor tetramerization domain-VP1 6 
activation domain sequences (SH3MTS). c) HT1080 B cells containing stably integrateo 
GAL4 responsive reporter gene were transfected with 100 ng of indicated expression 
plasmids. Mean values of SEAP activity secreted into the medium 24 hrs after transferor 
are shown (+/- S.D.). 

PIG. 7 Mutations for the p6S transcnption activation domain are listed, including: 

1 . Mutations that are intended to increase activation potency, including M1 , M2, M6.M7 
and M8. 

2. Mutations that are intended to slightly decrease activation potency, including M4 anc 
M5. 



Detailed Description of the Invention 
Definitions 

For convenience, the intended meaning of certain terms and phrases usee herein 

are provided below. 

"Activate" as appliec to the expression or transcription of a gene denotes a 
directly or indirectly observable increase in tne oroduction of a gene oroduct, e.g., an RNA 
or polypeptide encoded by the gene. 

"Capable of selectively hybridizing" means that two DNA molecules are 
susceptible to hybridization with one another, despite the presence of other DNA 
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nx)lecules, under hybridization conditions which can be chosen or readily determined 
empirically by the practitioner of ordinary skill in this art. Such treatments indude conditions 
ot high stringency such as washing extensively with buffers containing 0.2 to 6 x SSC, 
and'or containing 0.1 % to 1% SDS, at temperatures ranging from room temperature to 65- 
75°C. See for example F.M. Ausubel et al., Eds. Short Protocols in Molecular Biology, 
Units 6.3 and 6.4 (John Wiley and Sons, New York, 3d Edition. 1995). 

-Cells", "host cells" or "recombinant host cells" refer not only to the particular 
cells under discussion, but also to their progeny or potential progeny. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still 
mcluoeo within the scope of the term as used herein. 

"Cell line" refers to a population of cells capable of continuous or prolonged 
growth and division in vitro. Often, cell lines are clonal populations denved from a single 
progenitor cell. It is furtner known in the art that spontaneous or induced changes can occur 
in karyotype during storage or transfer of such clonal populations. Therefore, cells derived 
from the cell line referred to may not be precisely identical to the ancestral cells or cultures, 
and the cell line referred to includes such variants. 

"Composite", "fusion", and "recombinant" denote a material such as a nucleic 
acid nucleic acid sequence or polypeptide which contains at least two constituent portions 
which are mutually heterologous in the sense that they are not otherwise found directly 
(covalently) linked in nature, i.e.. are not found in the same continuous polypeptide or gene 
in nature, at least not in the same order or orientation or with the same spacing present in 
the composite, fusion or recombinant product. Typically, such materials contain 
components derived from at least two different proteins or genes or from at least two non- 
adjacent portions of the same protein or gene. In general, -composite" refers to portions of 
different proteins or nucfe.c acids which are joined together to form a single functional unit, 
while "fusion" generally refers to two or more functional units which are linked together. 
"Recombinant" is generally used in the context of nucleic acids or nucle.c acid sequences. 

"Cofactor refers to proteins which either enhance or repress transcnption in a 
non-gene specific manner. Cefaclors typically lack intrinsic DNA binding specificity, and 
function as general effectors. Positively acting cofactors do not stimulate basal 
transcription, but enhance the response to an activator. Positively acting cofactors include 
PC1 , PC2. PC3, PC4. and ACF. TAFs which interact directly with transcriptional activators 
are also referred to as cofactors. 

A "coding sequence" or a sequence wh.ch "encodes' a particular polypeptide or 
RNA. is a nucleic acid sequence which is transenbed (in the case of DNA) and translated 
(in the case of mRNA) into a polypeptide in vitro or in v.vo when placed under the control 
of an appropnate expression control sequence. The boundaries of the coding sequence 
are generally determined by a start cooon at the 5' (amino) terrrunus and a translation stop 
, codon at the 3 1 (carboxy) terminus. A coding sequence can include, but is not limited to. 
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cDNA from procaryotic or eukaryotic mRNA, genomic DNA sequences from procaryotic or 
eukaryot.c DNA, and synthet.c DNA sequences. A transcription termination sequence will 
usually be located 3' to the coding sequence. 

The term "conjoint", with respect to administration of two or more viruses, refers to 
the simultaneous, sequential or separate 'dosing of the individual virus provided that some 
overlap occurs in tne simultaneous presence of the viruses in one or more cells of the 
animal. 

A "construct", e.g., a "nucleic acid construct" or "DNA construct", refers to a 

nucleic acid or nucleic acid sequence. 

"Derived from" denotes a peptide or nucleotide sequence selected from within a 
given sequence. A peot.de or nucleotide sequence denved from a named sequence may 
further contain a small number cf modifications relative to the parent sequence, in most 
cases representing deletion, replacement or insertion of less than about 1 5%. preferably 
less than about 1 0%, and in many cases less than about 5%, of am.no acid residues or . 
bases present in the parent sequence. In tne case of DMAs, one DNA molecule .s also 
considered to be derived from another if the two are capable of selectively hybridizing to 
one another. Polypeotides or polypeptide sequences are also considered to be denved 
from a reference polypeptide or polypeptide sequence if any DNAs encoding the two 
polypeptides or sequences are caoable of selectively hybridizing to one another. 
Typically a derived pept.oe sequence will differ from a parent sequence by the 
replacement of up to 5 ammo acids, in many cases up to 3 amino acids, and very often by 
0 or 1 amino acids. A denved nucle.c acd sequence will differ from a parent sequence by 
the replacement of uo to 1 5 bases, in many cases up to 9 bases, and very often by 0 - 3 
bases. In some cases the amino acid(s) or base(s) is/are added or deleted rather than 
25 replaced. 

"Domain" refers to a portion of a protein or polypeptide. In the art. the term 
"domain" may refer tc a portion of a protein havtng a discrete secondary structure. 
However, as will be apparent from the context used herein, the term "domain- as used in 
this document does not necessany connote a given secondary structure. Rather, a 
peptide sequence is referred to herein as a "domain" simply to denote a polypeptide 
sequence from a defined source, or having or conferring an intended or observed activity. 
Domains can be derived from naturally occurring proteins or may compnse non-naturally- 

occurnng sequence. 

"DNA recognition sequence" means a DNA sequence which is capable ot 
binding to one or more DNA-b,nd,ng domains, e.g., of a transcription factor or an engineered 
polypeptide. 

"Expression control element", or s,mply -control element", refers to DNA 
sequences, such as initiation s.gnals. enhancers, promoters and silencers, wn.ch .nduce or 
control transcnption of DNA sequences with which they are operabiy linked. Control 
elements ot a gene may be located in introns, exons, coding regions, and 3" flanking 
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seouences. Some control elements are "tissue specific", i.e., affect express.on of the 
selected DNA sequence preferentially in specific cells (e.g., cells of a specific tissue), wh.le 
others are active in many or most cell types. Gene express.on occurs preferent,ally .n a 
specific cell if express-on in this cell type is observably higher than expression in other eel! 
types. Control elements include so-called "leaky" promoters, which regulate expression of 
a selected DNA primarily in one t.ssue, but cause expression in other tissues as well. 
Furthermore, a control element can act constitufively or induably. An .nducible promoter, for 
example is demonstrably more active in response to a stimulus than in the absence of that 
stimulus. A st.mulus can comprise a hormone, cytokine, heavy metal, phorbol ester, cyce 
AMP (CAMP), ret.noc ac.d or derivative thereof, etc. A nucieot.de sequence containing one 
or more expression control elements may be referred to as an "expression control 
sequence". 

-Gene" refers to a nucleic acid molecule or sequence compnsing an open read.ng 
frame and including at least one exon and (optionally) one or more intron sequences. 

"Genetically engineered cells" denotes cells which have been modified by the 
introduction of recombinant or heterologous nucleic acids (e.g. one or more DNA constructs 
or their RNA counterparts) and further includes the progeny of such cells which reta.n part 
or all of such genetic modification. 

"Heterologous", as it relates to nucleic acid or peptide sequences, denotes 
sequences that are not normally joined together, and/or are not normally associated with a 
particular cell. Thus, a "heterologous" region of a nucleic acid construct is a segment of 
nucleic acid within or attached to another nucleic acid molecule that is not found .n 
association with the other molecule in nature. For example, a heterologous region of a 
construct could include a coding sequence flanked by sequences not found in assoaat.on 
with the coding seouence in nature. Another example of a heterologous cod.ng sequence 
is a construct where the coding sequence itself is not found in nature (e.g., synthetic 
sequences having codons different from the native gene). Similarly, in the case of a cell 
transduced with a nucle.c acid construct which is not normally present in the cell, the cell 
and the construct would be considered mutually heterologous for purposes of this 
invention. Allel.c vanation or naturally occurring mutational events do not give nse to 
heterologous DNA, as used herein. 

"Initiator" refers to a short, weakly conserved element that encompasses the 
transcr,pt,on star, site and which is .mportant for directing the syntnes.s of properly .n.t.ated 
transenpts. 

"Interact" refers to directly or indirectly detectable interactions between molecules, 
such as can be detected using, for example, a yeast two hybrid assay or by 
.mmunopreciprtation. The term ".nteract" encompasses "binding" interactions between 
molecules. Interactions may be, for example, prote.n-prote.n, protein-nucleic ac.d. prote.n- 
small molecule or small molecule-nucleic acid in nature. 
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••Minimal promoter" refers to the minimal express.cn control element that is 
capable of initiating transcription of a selected DNA sequence to which it is ooerably Imkea 
A minimal promoter frequently cons.sts of a TATA box or TATA-like box. Numerous min.mal 
promoter sequences are known in the literature. 

"Nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), 
and where appropnate, ribonucleic acid (RNA). The term should also be understood to 
include derivatives, vanants and analogs of either RNA or DNA made from nucleotide 
analogs, and. as applicable to the embodiment be.ng described, single (sense or 
antisense) and double-stranded polynucleotides. 

"Operably linked" when referring to an expression control element and a coding 
sequence means that the expression control element is associated with the coo.ng 
sequence in such a manner as to permit or facilitate transcription of the coding sequence. 

A "recombinant virus" is a vims particle in which the packaged nucleic acid 
contains a heterologous portion. 

"Protein", "polypeptide" and "peptide" are used interchangeably. 
A -target gene" is a nucleic acid of interest, the expression of which is modulated 
according to the methods of the invention. The target gene can be endogenous or 
exogenous and can integrate into a cell's genome, or remain episomal. The target gene can 
encode, for instance, a protein, an antisense RNA or a ribozyme. 

The terms "transcriptional activation unit" and "activation unit", refer to a 
peptide sequence which is capable of inducing or otherw.se potentiating transition 
activator^iependent transcnption, either on its own or when linked covalently or non- 
covalently to another transcriptional activation unit. An activation unit may contain a m.n.mal 
polypeptide sequence which retains the ability to interact directly or indirectly with a 
transcnption factor. Unless otherwise dear from the context where a «us,on protein » 
referred to as "including" or "comprising" an activation unit, it will be understood that other 
portions of the protein from which the activation unit is derived can be included. 
Transcnptional activation units can be rich in certain amino acids. For example, a 
transcriptional actuation unit can be a peptide nch in acidic residues, glutamine. proline, or 
serine and threonine residues. Other transcnptional activators can be nch in .soleucine or 
basic amino acid residues (see, e.g.. Triezenberg (1995) Cur. Opin. Gen. Develop 5:190. 
and reference cited therein). For instance, an activation unit can oe a peptide motif of at 
least about 6 amino acid residues assoc.ated with a transcription activation domain, 
including the well-known "acidic", "glutamine-nch" and "proime-nch" motifs sucn as tne K13 
motif from p65. the OCT2 Q domain and the OCT2 P domain, respectively. 

The term "transcriptional activator" refers to a protein or protein complex, tne 
presence of which can increase the level of gene transcription in a ceil of a responsive 
gene It is thought that a transcnptional activator is capable of enhancing the efficiency 
with which the basal transcription complex performs, i.e., activating transcnption. Thus, as 
, used herein, a transcriptional actvator can be a single protein or alternatively rt can be 
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composed of several units at least some of which are not covalently linked to each other. 
A transcriptional activator typically has a modular structure, i.e., compnses one or more 
component domains, such as a DNA binding domain and one or more transcnptional 
activation units or domains. Transcnpt.cnal activators are a subset of transcr, P t;on factors, 
defined below. 

'Transcription factor" refers to any protein whose presence or absence 
contributes to the initiation of transcription but which is not itself a part of the polymerase. 
Certain transcnption factors stimulate transcnpt.on ("transcriptional activators'); other 
reoress transcnption ("transcriptional repressors"). Transcription factors are generally 
classifiable into two groups: («) the general transcnption factors, and (ii) the transcnpt.on 
activators Transcription factcrs usually contain one or more regulatory domains. Some 
transcnption factors contain a DNA binding domain, which is that part of the transcription 
factor which directly interacts with the expression control element of the target gene. 

"Transcription regulatory domain" denotes any domain which regulates 
transcription, and includes activation, synergizing and repression domains. The term 
"activation domain" denotes a domain, e.g. in a transcnption factor, which positively 
reaulates (increases) the rate of gene transcription. The term "repression domain" denotes 
a domain which negatively regulates (inhibits or decreases) the rate of gene transcription. 

A " transcription synergizing domain" is defined as any domain which 
increases the potency of transcriptional activation when present along with the 
transcription activation domain. A synergizing domain can be an independent 
transcriptional activator, or alternatively, a domain which on its own does not induce (or 
does not usually induce) transcnption but is able to potentiate the activity of a transcnpt.on 
activation domain. The synergizing doma.n can be a component domain of a fus.on prote.n 
containing the activation domain or can be recruited to the DNA binding domain or other 
component of the transcnption complex, e.g., via a bundling interaction. 

Transfection" means the .ntroduct.on of a naked nucle.c acid molecule into a 
recipient cell. "Infection" refers to the process wherein a nucleic acid is introduced .nto a 
cell by a virus containing that nucle.c acid. A 'productive infection' refers to the process 
wherein a virus enters the cell, is replicated, and is then released from the cell (sometimes 
referred to as a 'lytic' infection). "Transduction" encompasses the introduction of nucleic 
acid into cells by any means. 

-Transgene" refers to a nucleic acia sequence which has been introauced into a 
cell. Daughter cells denv.ng from a cell in which a transgene has been introduced are also 
, said to contain the transgene (unless it has been deleted). The polypeptide or RNA 

encoded by a transgene may be partly or entirely heterologous, i.e.. foreign, with respect 
to the animal or cell into which it is introduced. Alternatively, the transgene can be 
homologous to an endogenous gene of the transgenic animal or cell into which it is 
introduced, but is designed to be inserted, or is inserted, .nto the an.mafs genome in such a 
o way as to alter the genome of the cell into which it is inserted (e.g.. it is inserted at a 
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location which differs from that of the natural gene). A transgene can also be present in an 
episome. A transgene can include one or more expression control elements and any other 
nucleic acid, (e.g. intron}, that may be necessary or desirable for optimal expression of a 
selected coding sequence. 

The term "vector" refers to a nucleic acid molecule capable of transporting anotner 
nucleic acid to which it has been linked. One type of vector is an episome, i.e., a nucleic 
acid capable of extra^hromosomal replication. Often vectors are used which are caoabte 
of autonomous replication and/or exoress.on of nucleic acids to which they are linked. 
Vectors capable of directing tne expression of an included gene operatively hnxed to an 
expression control sequence can be referred to as "expression vectors". Expression 
vectors are typically in tne form of "plasmids" which refer generally to circular double 
stranded DNA loops which, in their vector form are not bound to the chromosome. In the 
present specification, "piasmid" and "vector' are used interchangeably as the plasmid is 
the most commonly used form of vector. However, the invention is intended to include such 
other forms of vectors which serve equivalent functions and which are or become known in 
the art. Viral vectors are nucleic acid molecules containing viral sequences which can be 
packaged into viral particles. 
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Bundling domains 

As described above, bundling domains interact with like domains via protein-protein 
interactions to induce formation of protein "bundles". Various order oligomers (dimers, 
tnmers, tertramers, etc.) of proteins containing a bundling domain can be formed, depending 
on the choice of bundling oomain. 

One example of a dimenzation domain is the leucine zipper (LZ) element. Leucine 
zippers have been identified, generally, as stretches of about 35 amino acids containing 4- 
5 leucine residues separated from each other by six amino acids (Maniatis and Abel (1989) 
Nature 341 :24-25). Exemplary leucine zippers occur in a vanety of eukaryotic DNA 
binding proteins, such as GCN4, C/EBP, c-Fos, c-Jun, c-Myc and c-Max. Other 
dimenzation domains include hehx-looD-helix domains (Mum?, C. et a!. (1989) Cell 55:537- 
544). Dimenzation domains may also be selected from other proteins, such as tne retinc;c 
acid receptor, the thyroid ho-mone receptor or other nuclear hormone receptors (Kurokawa 
et al. (1993) Genes Dev. 7:1423-1435) or from the yeast transcription factors GAL4 and 
HAP1 (Marmonstein et al. (1992^ Nature 356:40^414; Zhang et al. (1993) Prcc. Natl. Acad. 
Sci. USA 90:2851-2855). Dimenzation domains are further described in U.S. Patent No. 
5,624,818 by Eisenman. 
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Of particular current interest are tetramer-forming bundling domains. Incorporation of 
such a tetramerization domain within a fusion protein leads to the const.tut.ve assembly of 
tetrameric clusters orbundles. For example, a bundle of tour activation units can be 
assembled by covalentiy linking the activation unit to a tetramerization domain. By 
clustering the activation units together through a bundling domain, four activation units can 
be delivered to a single DNA bind.ng domain at the promoter. The E. coli lactose repressor 
tetramerization domain (amino acids 46-360; Chakenan et al. (1 991 ) J. Biol. Chem. 
266:137: . Albert, et al. (1993) EM BO J. 12:3227; and Lewis et al. (1996) Nature 
271 :1247). illustrates this class. Furthermore, since the fusion proteins may contain more 
tnan one activation unit unKed to the bundling domain, each of the four proteins of the 
tetramer can conta.n mere than one activation unit (and the complex may comprise more 
than 4 activation units). 

Other illustrative tetramerization domains include those derived from residues 322- 
255 of p53 (Wang et al. (1994) Mol. Cell. Biol. 14:5182; Clore et al. (1994) Science 
265:386) see also U.S. Pat. No. 5.573,925 by Halazonetis. Other bundling domains can 
be derived from the D.merizat.on cefaclor of hepatocyte nuclear factor-1 (DCoH). DCoH 
associates with specific DNA binding proteins and also catalyses the dehydration of the 
biopterin cofactor of phenylalanine hydroxylase. DCoH is a tetramer. See e.g. Endrizzi, 
J A. Cronk, J.D.. Wang, W., Crabtree, G.R and Alber, T. (1995) Science 268, 556559; 
Suck and Ficner (1 995) FEBS Lett 389(1 ):3-39; Standmann, Senkel and Ryffel (1998) Int J 

Dev Biol 42(1):53-59 

The bundling domain may comprise a naturally-occurring peptide sequence or a 
modified or artificial pect.de sequence. Sequence modifications in the bundling domain may 
be used to increase the stability of bundle formation or to help avoid unintended bundling 
with native protein molecules in the engineered cells which conta.n a wild-type bundling 
domain. 

For example, sequence substitutions that stabilize oligomenzation driven by leucine 
zippers are known (Krylov et al. (1994) cited above; O'Shea et ai. (1992) cited above). 
To illustrate, residues 1 74 or 175 of human p53 may be replaced by glutamine or leucine, 
respectively. 

To illustrate sequence modifications aimed at avoiding unintended bundling with 
endogenous protein molecules, the p53 tetramenzation domain may be modified to recuce 
the likelihood of bundling witn endogenous p53 proteins that have a wild-type p53 
tetramerization domain, such as wild-type p53 or tumor-derived p53 mutants. Such 
altered p53 tetramenzation domains are described in U.S. Pat. No. 5.573.925 by 
Halazonetis and are characterized by disruption of the native P 53 tetramenzation domain 
and insertion of a heterologous bundling domain in a way that preserves tetrameriza'acn. 
Disruption of the p53 tetramenzation domain involving residues 335-348, or a subset of 
these residues, sufficiently dismpts the function of this domain so that it can no longer drive 
tetramerization with wild-type p53 or tumor^erived p53 mutants. At the same time, 
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however introduction of a heterologous dimerization domain reestablishes the ability to 
form tetramers. which is mediated both by tne heterologous dimerization domain and by the 
residual portion of the p53 tetramenzation aomain sequence. 

Other suitable bundling domains can be readily selected or designed by the 
practitioner including semi-artificial bundling doma.ns. such as vanants of the GCN4 leucine 
rpper that form tetramers (Albert, et al. (1993) EMBO J. 12:3227-3236; Harburyetal. 
(1993) Science 262:1401-1407; Krylov et al. (1994) (1994) EMBO J. 13:2849-2861). The 
tetrameric variant of GCN4 leucine zipper descr.bed in Harbury et al. (1 993). supra, has 
.soleucines at positions d of the coiled coil and leucines at positions a, in contrast to the 
original zipper which has leucines and valines, respectively. 

The choice of bundling domain can be based, at least in part, on the desired 
conformation of the bundles For instance, the GCN4 leucine zipper drives parallel subuntt 
assembly [Harbury et al. (1 993), cited above], while the native p53 tetramer.zauon 
domain drives antiparalle' assembly [Core et al. (1994) cited above; Sakamoto et al. 
(1994) Proc. Natl. Acad. Sa. USA 91:8974-89781. 

In addition, a variety of tecnmques are available for identifying other naturally 
occurring bundling domains, as well as tor selecting bundling domains derived from mutant 
or otherwise artificial sequences. See. tor example. Zeng et al. (1997) Gene 185245; 
O'Shea et al. (1992) Cell 68:699-708: Krylov et al. [cited above]. 

In applications of the invention involving the genetic engineering of cells within (or 
for use within) whole animals, tne use of peptide sequence derived from that species is 
preferred when possible. For instance, for applications involving human gene therapy, use 
of bundling domains denved from human proteins may minimize the risk of immunogenic 
reactions. However, in some cases the use of bundling domains of human origin may 
induce interactions between tne fusion proteins and the endogenous protein from which the 
bundling domain was denved, i.e.. leading to unwanted bundling of fusion prote.ns with the 
endogenous protein containing the ident.cal bundling domain. Such interactions, in addrton 
to inhibit.ng target gene expression, may also have other aoverse effects .n the cell, e.g.. 
by interfenng with the functon of the endogenous protein from which the bundling doma.n 
30 was derived. 

Approaches for avoiding unwanted bundling of fusion proteins of this invention 
with endogenous proteins include using a bundling domain which is (a) heterologous to the 
host organism, (b) expressed by the host organism but only (or oredom.nantly) in cells or 
tissues other than those which will express the fusion proteins, or (c) engineered through 
35 modification in peptioe sequence such tnat it bundles preferentially with itself rather tnan 
with an endogenous bundling aomain. 

The first approach is illustrated by the use of a bacterial lac repressor 
tetramenzation domain tn human ceils. 

The second approach requires the use of a bundling domain denved from a protein 
40 which is not expressed in the cells or t.ssues which are to be engineered to express the 
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fusion protein(s) of this invention, at least not at a level wh.ch would cause undue 
interference with the bundling application or with normal cell function. Fusion prote.ns 
containing a bundl.nc domain derived from an endogenous protein exoressed selectively or 
preferentially in one tissue could be expressed in a different tissue without any aoverse 
effects For example, tc regulate gene expression in human muscle, fus.on proteins 
containing bundling domains from a protein expressed in liver, bram or some other tissue or 
tissues-but not in muscle-can be expressed in muscle cells w.tnout uncue nsk of 
mismatched bundling. 

In the third approach, and as noted previously, the binding specificity of the 
bundling domain .s enomeerec by alterations in peptide sequence to replace (.n whole or 
part, bundling activity for proteins containing the wild-type bundling domain with bundling 
activity for proteins containing the modified peptide sequence. 

Several examples of tissue-specific bundling domains which could be used in the 
practice of this invention include bundling domains derived lrom the Retinoid X receptor, 
(Kersten S Reczek, P.R and N. Noy (1997) J. Biol. Chem. 272, 29759-29768); 
Dopamine D3 receptor (N.mchinsky, EA, Hof. P.R.. Janssen, W.G.M.. Morrison. J.H and 
C Schmauss (1997) J. Biol. Chem. 272. 29229-29237); Butyrylcholinesterase (Blong, 
RM Bedows, E and O. Lockridge (1997) Biochem. J. 327, 747-757); Tyrosine 
Hydroxylase (Goodwill, K.E., Sabatier, C. Marks. C. Raag, R.. Frtzpatrick, P.F and R.C. 
Stevens (1997) Nat. Struct. Bio! 7, 578-585). Bcr (McWhirter, J.R., Galassc, D.L and J.Y. 
Wang (1993) Mol. Cell. Biol. 13. 7587-7595); and Apolipoprote.n E (Westerlund, J.A and 
K.H. Weisgraber (1993) J. Biol. Chem. 268. 15745-15750). 

Transcription Activation Domains / Activation Units 

Transcription activation domains and activation units can compnse naturally- 
occurring or non-naturally-occurnng peptide sequence so long as they are capable of 
activating or potentiating transcnption of a target gene construct. A vanety of polypeptides 
and polypeptide sequences which can activate or potentiate transcnption ,n eukaryotic 
cells are known and in many cases have been shown to retain their activate function 
when expressed as a component of a fusion protein. An activation unit is generally at 
least 6 amino ac.ds, and preferably contains no more than about 300 ammo ac.d res.dues. 
more preferably less than 200. or even less than 100 residues. 

Naturally occumng activation units include portions of transcnpt.on factors, sucn as 
a thirty am.no acid sequence from the C-terminus of VP16 (ammo ac.ds 461 -490). referred 
to herein as "Vc". Other activation units are derived from naturally occurring peptides For 
example the replacement of one amino ac.d of a naturally occumng activation unit by 
another may further increase activation. An example of such an activat.on unit is a 
der.vat.ve of an e.ght am.no ac.d peptide of VP 16. the derivative hav,ng the am.no acid 
sequence DFDLDMLG. Other activation units are "synthetic" or -artificial" in that they are 
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not denved Irom a naturally occurnng sequence. It is known, for example, that certain 
ranaom alignments of acidic am.no aads are capable of activating transcr.pt.on. 

Certam transcription factors are known to be active only in specific cell types, i.e., 
•hey activate transcription ,n a tissue specific manner. By using activation units wh.ch 
; u nct;on select.vely or preferentially in specific cells, it is possible to design a transcnpt.ona. 
activator of the invention having a desired tissue specificity. 

One source of peptide sequence for use in a fusion protein of this invent.on ,s the 
herpes simplex virus virion protein 16 (referred to herein as VP16. the ammo acid sequence 
ol which ,s aisciosed in Tnezenberg. S.J. et a!. (1988) Genes Dev. 2:718-729). For 
example an activation unit corresponding to about 127 of the C-term.nal am.no acids of 
VP1 6 can be used Alternatively, at least one copy of about 1 1 ammo acids from the C- 
termmai region of VP16 wh,ch retains transcnption act,vat,on ability is used as an activation 
unit. Preferably, an oligomer comprising two or more copies of this sequence « used 
Suitable C-terminal peotide portions of VP1 6 include those descnbed .n Se.pel. K. et al. 

is (EMBO J. (1992) 13:4961-4968). 

Another example cf an acidic activation unit is provided in rescues 753-881 of 

GAL4 ' One particularly important source of transcription activation units is the (human) NF- 
kB subunit p65. The activation domain may conta.n one or more copies of a pept.de 
20 seauence comprising all or pari of the p65 sequence spanning res.Oues 450-550. or a 
peptide sequence denved therefrom. In certain embodiments, it has been found that 
extending the P 65 peptide sequence to include sequence spanning p65 res.dues 361 -450. 
e.g., including the "AP activation unir. leads to an unexpected increase in transcr.p.on 
activation. Moreover, a peptide sequence comprising ail or a port.cn of p65(361 -550). or 
25 peptide sequence denved tnerefrom, in combination with heterologous activation units, can 
vield surprising additional increases .n the level of transcription activation. p65-based 
activation domains function across a broad range of promoters and in a numoer of bundlmg 
experiments have yielded increases in transcnption levels of chromosoma.lv incorporated 
target genes six-fold, eight-told and even 14-15-fold h.gher than obtained with unbundled 
30 tandem copies of VP16 which itself is widely recognized as a very potent activation 

domain. , K 

It >s expected that reccmomant DNA molecules encoding fusion prote.ns wh.ch 
contain a o65 activation unit, or pept.de sequence denved therefrom, will provKte significant 
aovantages for heterologous gene expression in ,ts venous contexts, inducing d.menzat.on 
3S based regulated systems such as descnbed in Internationa! patent appt.cat.ons 

PCT/US94/01617. PCT/US95/10591 , PCT/US96/09948 and the l.ke. as well as in other 
heterologous transcnption systems including allostery-based regulation such as those 
,nvolv,ng tetracyd.ne-b.sed regulation reported by Bu.aro et al. and those mvo.v.ng 
steroid or other hormone-based regulation. 
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One class of p65-based transcription factors contain more than one copy of a p65- 
denved domain. Sucn proteins will typically contain two or more, generally up to about six. 
cooies of a peptiae sequence comprising all or a portion of p65(361-550). or peptide 
sequence derived therefrom. Such iterated p65-based transcription activation domains are 
useful both in bundled and non-bundled approaches. 

Other polypeptides with transcription activation activity in eukaryotic cells can be 
used to provide activation units for the fusion proteins of this invention. Transcription 
activation domains found within various proteins have been grouped into categories based 
upon shared structural features. Types of transcription activation domains include acidic 
transcription activation domains (noted previously), proline-r.ch transcription activation 
domains, serineAhreonine-rich transcription activation domains and glutamme-nch 
transcription activation coma.ns. Examples of prolme-rich activation doma.ns include ammo 
acid residues 399-499 of CTF/NF1 ano ammo acid residues 31-76 of AP2. Examples of 
senneAhreomne-rich transcription activation doma.ns incfude amino acid residues 1^27 of 
5 ITF1 and amino acid residues 2-451 of ITF2. Examples of glutamine-rich activation 

domains include ammo acid residues 1 75-269 of Oct1 and amino acid residues 1 32-243 of 
Sp1 . The amino acid sequences of each of the above described regions, and of other 
useful transcription activation domains, are disclosed in Seipel, K. et al. (EMBO J. (1 992) 
13:4961-4968). 

•o Still other illustrative activation domains and motifs of human ongin include the 

activation domain of human CTF, the 1 8 amino acid (NFLQLPQQTQGALLTSQP) 
glutamine rich region of Oct-2. the N-terminal 72 amino acids of p53. the SYGQQS repeat 
in Ewing sarcoma gene and an 1 1 ammo acid (535-545) acidic rich region of Rel A protein. 
In addition to previously aescribed transcription activation domains, novel 
25 transcription activation units, which can be identified by standard techniques, are within tne 
scope of the invention. The transcription activation ability of a polypeptide can be 
assayed by linking the polypeptide to a DNA binding domainand determining the amount 
of transcription of a target sequence that is stimulated by the fusion protein. For example, 
a standard assay used in the art utilizes a fusion protein of a putative activation unit and a 
30 GAL4 DMA binding domain (e.g., amino acid residues 1 -93). This fusion protein is then 
used to stimulate expression of a reporter gene linked to GAL4 binding sites (see e.g., 
Seipel, K. et al. (1992) EMBO J. 1 1:4961-4968 and references cited therein). 

The activation comains of the invention can be from any eukaryotic species 
(including but net limited to various yeast spec.es and var.ous vertebrate species, 
including the mammals), and it is not necessary that every activation unit or domain be from 
the same species. In applications of this invention to whole organisms, it is often 
preferable to use activation units and activation domains from tne same soec.es as the 
recipient to avoid immune reactions against the fusion proteins. 
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Mutations in the Activation Domain 

One way to increase the potency of an activation oomain is to increase its ac.o.c or 
hydroDhobic content through modifications in pept.de sequence. Acidic ammo acos which 
can increase potency of activation domains include aspartic acid ana glutamic acid. In some 
cases one mav want to dPcrease (usually only modestly) the potency of the activation 
domain in order to obtain a less steep activat.on curve, especially if a greater number of 
individually weaker activation domains will be deployed together, e.g.. by bundling. 

Thus, in one embodiment of this invention, mutations are introduced into the 
activation oomain by standard techniques known in the art. such as site-erected PCR 
based mutagenesis. In this embodiment, one to five, in some cases one to three, 
alterations in peptide sequence can be introduced into the DNA cod.ng for the activation 
oomain. Each of these mutations eitner alone or .n combination with one or more other 
mutations may be readily assayed for its ability to induce the transcr.ption of e.ther 
transiently transfected or stably integrated target reporter gene constructs. For instance, a 
construct encoding a fus.on protein containing multiple copies of the modified sequence and 
a DNA bindmg oomain can be introduced into cells and the activity of the encoded fus.on 
protein measured in transcription assays (with a responsive reporter gene construct) and 
compared to analogous fusion proteins containing wild-type activat.on domain sequence or 

a different mutation of interest. 

The foregoing is illustrated in the case of the P 65 transcription activation domains 
Constructs are prepared encoding fusion proteins containing one or more P 65 transcription 
activation domains and a DNA binding domain. The p65 domains may be w.ld-type (as a 
control) or may contain any of a variety of alterat.ons in peptide sequence. These 
mutations can generally be mtroauced into a var.ety of p65denved transcnption act.vat.on 
domains. For example, Ml mutations can be introduced into plasmids carrying p65 
activation doma.n coding reg.ons between amino acids 533 and 550, or 361 and 550. or 
280 and 550. 

Exemplary mutations for p55 transcnption activation domains include those 
intended to increase the potency of the p65 activation domain (including the M1 , M2. 
M6.M7 and M8 mutations) and those intended to decrease the potency (generally slightly) 
of the activation domain. The P 65 activation domain contains four phenylalan.ne residues 
and mutations that convert these residues to alanine has been shown to significantly 
reduce the potency of the p65 activation domain in yeast and in vitro experiments. Our 
expenments show that changing F 533 and F 541 to alan.ne residues reouced tne potency 
of P 65 activation ooma.n to half of wild type level. Mutations of tne M4 and M5 class 
change the conserveo senne and proline residues between am.no-acios 361 and 450. Our 
cata show that M4 and M5 mutant sequences can induce the expression of target genes 
synergistics when fused to other acidic type activation domains. In GST pull down 
assays the region of the M4 and M5 mutations interacts with TFIIA. Although M4 and M5 
mutations ind.v.dually have a very small effect on the ability of p65 activat.on ooma.n to 
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.nduce the reporter aene, combined together, they significantly reduce its potency. Thus, 
mutations tor the practitioner to bear in mind include, but are not limited to, the following: 

WT: 532-DPSS1ADMDFSALLSQIS 

Mt: 532-DFSDFADMDFDADLSQIS 

WT: 439-ALLQLQFDDED 

M2: 439-ALLDLDFDDED 

WT: 529-GDEDFSSIADMDFSALLSQI 

M3: 529-GDEDASS1ADMDASALLSQI 

WT: 377-SALALPAPPQVL 

M4: 377-GALALGAGGQVL 

WT: 401-SALAQAPAPVP 

M5: 401-GALAQAGAGVG 

WT: 434-GTLSEALLQLQFD 

M6: 434-GDFS-ALLQLQFD 

WT: 472-SEFQQLLNQ 

M7: 472-SEFSALLNQ 

WT: 472-SEFQQLLNQ 

M8: 472-SDFQQLLNQ 

WT: 530-DEDFSSlADMDFS 

M9: 530-DEDFSSLLDMDFS 

Synergizing Domains 

A synerg.zing domain is any domain which observably increases the potency of 
transcription activation when recruited to the promoter along with the transcnpt>on activation 
domain A synergizing domain can be an independent transcription acfvat.cn doma.n or an 
activation unit which on its own does not induce transcnption but is able to potentiate the 
activity of a transcription activation domain with which it is linked covalently (i.e.. within the 
same fusion protein) or with wh.ch it is associated non-covalently (e.g., through bundling or 
ligand-mediated clustering). 

One example of a synerg.zing doma.n is the so-called "alanma'prot.ne nch" or "AP~ 
act,vat,on motif of P 65. which extends from about am.no ac.ds 361 to about am.no acid 450 
of that protein. Simi.ar AP act.vat.cn motifs are also present in. e.g., the P 53 and CTF 
proteins. The presence of cne or several ccp.es ot an AP doma.n alone .n a proteir coes 
not itself provide the abilrty to .nduce activator-dependent transcription activation. 
However, when linked to activation units which are themselves capable of inducing some 
level of activator-oependent transcnpt.on, e.g., another portion of p65 or VP16. the AP 
activation unit synergizes with the second activat.on doma.n to induce an .ncrease in the 
level of transcription. 
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Accordingly, the invention provides an AP activation unit, functional derivative 
thereof, or other synergizing domain which on its own is incapable cf activating 
transcription. Functional alternative sequences for use as synergizing aomams, including 
among otners derivatives of an AP activation unit, can be obtained, for instance, by 
5 screening candidate sequences for binding to TFIIA and measuring transcriptional activity in 
a co-transfection assay. Such equivalents are expected to include forms of the activatton 
unit which are truncated at either the N-terminus or C-terminus or both, e.g., fragments of 
P 65 (or homologous sequences thereto) which are about 75, 60, 50, 30 or even 20 ammo 
acid residues in length (e.g., ranging in length from 20-89 ammo acids). Likewise, it is 
ic expected that the AP activation unit sequence from p65 can tolerate amino acid 

substitutions, e.g., to produce AP motifs of at least 95%, 90%, 80% and even 70% 
identity with the AP activation unit sequence of SEQ ID No. 2 of USSN 08/918,401 . 
These and other AP derivatives include, for example, AP domains based on natura'ly- 
occurnng sequence but modified by the replacement, insertion or deletion of 1 , 2, 3, 4 or 5 

15 ammo acid residues. 

Other synergizing domains are independent activation domains, e.g. VP16. While 
VP16 can activate transcription on its own, it can synergize with p65 to produce levels of 
transcription that are greater than the sum of the transcription levels effected by each 
activation domain alone. As shown in the examples, fusion of VP1 6 to a nucleic acid 

2C containing an FRB domain, a lac reoressor tetramenzation domain and p65 greatly 

increases tne level of expression of a target gene as compared to the same construct in the 
absence of VP16. 

Synergizing domains may also be fused to an unbundled or bundled DNA binding 
domain. To avoid the activatton of transcription in a constitutive manner with constructs 
25 such as these, it is preferable tnat the synergizing domain itself be incapable of activating 
transcription. 



Ugand binding domains 

Fusion proteins containing a ligand binding comain for use in practicing this 
30 invention can function through one of a variety of molecular mechanisms. 

In certain embodiments, the ligand binding domain perm.ts ligand-mediated cross- 
linking of the fusion protein molecules beanng appropriate ligand binding aomams. in these 
cases, the ligand is at least divalent and functions as a dimenzing agent by binding to the 
two fusion proteins and forming a cross-linKed hetercdimenc complex wh.cn activates target 
gene expression. See e.g. WO 94/18317, WO 96/20951, WO 96/06097, WO 97/31898 
and WO 96/41865 

In other embodiments, the binding of ligand to fusion protein is thought to result in 
an allosteric change in the protein leading to the binding of the fusion protein to a target 
DNA seauence [see e.g. US 5.654,168 and 5,650.298 (tet systems), and WO 93/23431 
and WO 98/18925 (RU486-based systems)] or to another protein which binds to the 
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target DNA sequence [see e.g. WO 96/37609 and WO 97/381 17 (ecdysone/RXR-based 
systems)], in either case, modulating target gene expression. 

Dimenzation-based systems 

In the cross-linking-based dimenzation systems the fusion prote.ns can contain one 
or more Iigand binding domains fin some cases containing two, three or four such domains) 
and can further contain one or more additional domains, heterologous with respect to the 
l.gand binding domain, mclud.ng e.g. a DNA binding domain, transcription activation domain, 
etc 

In general, any l.gand/ligand binding domain pair may be used in such systems. For 
example. Iigand binding domains may be derived from an immunophiim such as an FKBP. 
cyclophilm, FRB domain, hormone receptor protein, antibody, etc., so long as a Iigand is 
known or can be identified for the Iigand binding domain. 

For the most pan. the receptor domains will be at least about 50 ammo acids, and 
fewer than about 350 am.no acids, usually fewer than 200 amino acids, either as the 
natural domain or truncated active portion thereof. Preferably the binding domain will be 
small (<25 kDa, to allow efficient transtection in viral vectors), monomelic, nonimmunogenic, 
and should have synthetically accessible, cell permeant. nontoxic ligands as described 
above. 

Preferably the Iigand binding domain is for (i.e.. binds to) a Iigand which is not itself 
a gene product (i.e., is not a protein), has a molecular weight of less than about 5 kD and 
preferably less than about 2.5 kD, and is cell permeant. In many cases it will be preferred 
that the Iigand does not have an intrinsic pharmacologic activity or toxicity which interferes 
with its use as a transcription regulator. 

The DNA seouence encoding the Iigand binding domain can be subjected to 
mutagenesis for a variety of reasons. The mutagen.zed Iigand binding domain can provide 
for higher binding affinity, allow for discrimination by a Iigand between the mutant and 
naturally occurnng forms of the Iigand binding domain, provide opportunities to design 
ligand-ligand binding domain pairs, or the like. The change in the Iigand binding domain can 
involve directed changes in amino acids known to be involved in Iigand binding or with 
ligand-dependent conformational changes. Alternatively, one may employ random 
mutagenesis using combinatorial techniques. In either event, the mutant Iigand binding 
domain can be expressed in an appropriate prokaryotic or eukarotic host and then 
screened for desired licand binding or conformational properties. Examples involving FKBP. 
35 cyclophilin and FRB domains are disclosed in detail in WO 94/18317, WO 96/06097. WO 
97/31898 and WO 96/41865. For instance, one can change Phe36 to Ala and/or Asp37 to 
Gty or Ala in FKBP 12 to accommodate a substituent at positions 9 or 10 of the Iigand 
FK506 or FK520 or analogs, mimics, dirners or other derivatives thereof. In particular, mutant 
FKBP12 domains which contain Val. Ala. Gly. Met or other small am.no acids in place of one 
40 or more of Tyr26. Phe36. Asp37. Tyr82 and Phe99 are of particular interest as receptor 
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domains for FK506-type and FK-520-type iigands containing modifications at C9 and/or 
C10 and the.r synthet.c counterparts (see e.g.. WO 97/31898). Illustrative mutations of 
current interest in FKBP domains also include the following: 



F36A 


! Y26V 


i F46A 


; W59A 


F36V 


Y26S 


| F48H 




H87W 


F36M 


D37A 


i F48L 




H87R 


F36S 


I90A 


F48A 




F36V/F99A 


F99A 


I9* A 


E54A/F36V/F99G 




F99G 


F46H 


E54K/F36M/F99A 


, Y26A 




F46L 


V55A 


F36NVF99G 
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Table V Entries identity the native amino acid by single letter code and sequence position, 
Slowed by the replacement ammo acd in the mutant. Thus. F36V designates a human 
FKBP12 sequence in wh.cn ohenylalanine at position 36 is replaced by vahne 
F36V/F99A indicates a double mutation in which phenylalanine at positions 36 and S9 are 
replacedby valine ana alanine, respectively. 



Illustrative examples of domains which bind to the FKBP:rapamycin complex 
("FRBs") are those which incluae an approximately 89-amino acid sequence containing 
residues 2025-21 13 of human FRAP. Another FRAP-denved sequence of interest 
comprises a 93 ammo acid seauence consisting of amino acids 2024-21 13. Similar 
considerations apply to the generation of mutant FRAP-derived domains which bind 
preferentially to FKBP complexes with raoamycin analogs (rapalogs) containing 
modifications (i.e.. are bumped') relative to rapamycin in the FRAP-binding portion of the 
drug. For example, one may obtain preferential binding using rapalogs bearing 
substituents other than -OMe at the C7 position with FRBs based on the human FRAP 
FRB peptide sequence but bearing amino acid substitutions for one of more of the residues 
Tyr2038. Phe203S. Thr2098, Gln2099, Trp2101 and Asp2102. Exemplary mutations 
include Y2038H. Y2038L, Y2038V. Y2038A, F2039H, F2039L. F2039A, F2039V, D2102A, 
T2098A. T2098N, T2098L, and T209BS. Rapalogs bearing substituents other than -OH at 
C28 and'or substituents other than =0 at C30 may be used to obtain preferential binding 
to FRAP proteins bearing an amino acid substitution for Glu2032. Exemplary mutations 
include E2032A and E2032S. Proteins compnsing an FRB containing one or more amino 
acid replacements at the foregoing positions. Iibranes of prote.ns or peptides randomized at 
3C those positions (i.e., containing various substituted amino acids at those residues), 

libraries randomizing the entire protein domain, or combinations of these sets c: mutants are 
made using the procedures oescnbed above to identify mutant FRAPs that bind 
preferentially to bumped rapalogs. 
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Other macrolide b.nding domains useful in the present invention, including mutants 
thereof, are described in the art. See. for example. W096/41865, W096/13613. 
WO96/06111. WO96/06110, WO96/06097, W096/12796, WO95/05389. WO95/02684. 
W094/18317. 

The ability to employ in vitro mutagenesis or combinatorial modifications of 
sequences encccmc proteins allows for the production of librar.es of proteins whicn can be 
screened for binding affinity for different ligands. For example, one can random.ze a 
sequence of 1 to 5. 5 to 10, or 10 or more codons. at one or more sites in a DNA sequence 
encoding a binding protein, make an expression construct and introduce the expression 
construct into a unicellular microorganism, and develop a library of modified sequences. 
One can then screen tne library for b.nding affinity of the encoded polypeptides to one or 
more ligands. The best affinity sequences which are compatible with the cells into which 
they would be introduced can then be used as the ligand bind.ng domain for a given ligand. 
The ligand may be evaluated with the desired host cells to determine the level of binding of 
the ligand to endogenous proteins. A binding profile may be detemined for each such 
ligand which compares ligand binding affinity for the modified ligand binding domain to the 
affinity for endogenous proteins. Those ligands which have the best binding profile could 
then be used as the ligand. Phage display techniques, as a non-limiting example, can be 

used in carrying out the foregoing. 

In other embodiments, antibody subunits. e.g. heavy or light chain, particularly 
fragments, more particularly all or part of the variable region, or single chain antibodies, can 
be used as the ligand b.nding domain. Antibodies can be prepared against haptens which 
are pharmaceutical^ acceptable and the individual antibody subunits screened for binding 
affinity. cDNA encoding the antibody subunits can be isolated and modified by deletion of 
the constant region, portions of the variable region, mutagenesis of the vanable region, or 
the like, to obtain a bind.ng protein domain that has the appropriate affinity for the ligand. 
In this way, almost any physiologically acceptable hapten can be employed as the ligand. 
Instead of antibooy units, natural receptors can be employed, especially where the 
binding domain is known. In some embodiments of the invention, a fusion protein 
composes more than one ligand binding domain. For example, a DNA binding domain can 
be linked to 2, 3 or 4 or more ligand binding domains. The presence of multiple ligand 
binding domains means that l.gand-mediated cross-linking can recruit multiple fusion 
proteins containing transcription activation domains to the DNA binding domain-containing 
fusion protein. 

Allostery-based systems 

As mentioned previously, systems for transcnption regulation based on ligand- 
dependent allostenc changes in a chimenc transcnption factor are also useful in practicing 
the subject invention. One such system employs a deletion mutant of the human 
progesterone receptor which no longer binds progesterone or other endogenous steroids 
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but can be activated by the orally active progesterone antagonist RU4B6, described, e.g., 
in Wang et al. (1 994) Proc. Natl. Acaa. Sci. U.S.A. 91 :81 80. Activation was demonstrated 
in cells transplanted .nto mice using doses of RU486 (5-50 g/kg) considerably below the 
usual dose for induc.ng abortion in humans (10 mg/kg). However, the reported induction 
ratio in culture and in animals was rather low. 

Another such system is the ecdysone inoucible system. Early work oemonstrated 
mat fusing the Drosophila steroid ecdysone (Ec) receptor (EcR) Ec- binding domain to 
heterologous DNA b.ndmg and activation domains, such as E. coli lexA and herpesvirus 
VP1 6 permits ecdysone-dependent activation of target genes downstream of appropriate, 
binding sites (Christopnerson et al. (1992) Proc. Natl. Acad. Sci. U.S.A. 89:6314). An 
improved ecdysone regulation system has been developed, using the DNA b.nding 
domain of the EcR itself. In this system, the regulating transcnption factor is provided as 
two proteins: (1) a truncated, mutant EcR fused to herpes VP16 and (2) the mammalian 
homolog (RXR) of Ultraspiracle protein (USP), which heterodimerizes with the EcR (No et 
al. (1996) Proc. Natl. Acad. Sc.. U.S.A. 93:3346). In this system, because the DNA 
binding domain was also recognized by a human receptor (the human farnesoid X 
receptor), it was altered to a site recognized only by the mutant EcR. Thus, the invention 
provides an ecdysone inducible system, in which a truncated mutant EcR is fused to at 
least one subunit of a transcription activator of the invention. The transcription factor further 
comprises USP, thereby providing high level induction of transcription of a target 
genehaving the EcR target sequence, dependent on the presence of ecdysone. 

In another approach, the inducible system comprises or is derived from the 
E. coli tet repressor (TetR). which binds to tet operator (tetO) sequences upstream of 
target genes. In the presence of tetracycline, or a tetracycline analog which bind to tetR. 
DNA binding is abolished and thus transaction is abolished. This system, in which the 
TetR had previously been linked to transcription activation domains, e.g. from VP16, is 
generally referred to as an allosteric -off-switch" described by Gossen and Bujard (Proc. 
Natl. Acad. Sci. U.S.A. (1992) 89:5547) and in U.S. Patents 5,464.758; 5,650.298; and 
5,589.362 by Bujard et al. Target gene expression is reportedly reguiatable over several 
orders of magnitude in a reversible manner. This system is said to provide low 
background and relatively high target gene expression in the absence of tetracycline or an 
analog. The invention described herein provides a method for obtaining even stronger 
transcription induction of a target gene, which is reguiatable by the tetracycline system or 
other inducible DNA binding domain. 

In some emDod.ments. a "reverse" Tet system is used, again baseo on a DNA 
binding domain that is a mutant of the E. coli TetR. but wtuch binds to TetO in the 
oresence of Tet. Additional information on mutated tetR-based systems is prov.dec above 
and in patent documents cited prev.ously. The use of bundling as descnoec herein 
provides a method for obtaining even stronger transcnption induction of a target gene in the 
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presence of tetracycline or an analog thereof from a very low background in the absence of 
tetracycline. 

A tetR domain useful in the practice of this invention may comDr.se a naturally 
cccurnng pept.de sequence of a tetR of any of the various classes (e.g. class A. B. C. D 
a- E) (in which case the absence of the Hgand stimulates target gene transcnpt.on). or more 
preferably, compnses a mutated tetR which is derived from a naturally occurring seouence 
from which it differs by at least one amino acid substitution, addit.on or deletion. Of 
particular interest are those mutated tetR domains in which the presence of the Hgand 
stimulates binding to the TetO sequence, usually to induce target gene transcription m a 
cell engineered in accordance with this invention. For example, mutated tetR domains 
mclude mutated Tn10-denved tetR domains having an amino acid substitution at one or 
more of amino acid oositions 71. 95, 101 and 102. By way of further illustration, one 
mutated tetR compnses ammo acids 1 - 207 of the TM0 tetR in which glutamic ac.d 71 ,s 
changed to lysine, aspart.c acid 95 is cnanged to asparagine, leucine 101 is changed to 
senne and glycine 102 is changed to aspart.c add. Ligands include tetracycline and a wide- 
vanety of analogs and mimics of tetracycline, including for example, anhydrotetracyclme 
and doxycyclme. Target gene constructs in these embodiments contain a target gene 
operably linked to an expression control sequence including one or more copies of a DNA 
sequence recognized by the tetR of interest, includ.ng for example, an upstream activator 
sequence for the appropriate tet operator. See e.g. US Patent No. 5,654,168. 

Ligands of the invention 

In various embodiments where a ligand binding domain for the ligand is 
endogenous to the cells to be engineered, it is often desirable to alter the peptide 
sequence of the ligand binding domain and to use a ligand which discnminates between the 
endogenous and engineered ligand binding domains. Such a ligand should bind 
preferentially to the engineered ligand binding domain relative to a naturally occurring 
peptide sequence, e.g.. from wh.ch the modified domain was denved. This aoproach can 
avoid untoward intrinsic activities of the ligand. Significant guidance and illustrative 
examples toward that end are provided in the various references cited herein. 

Cross-linking/dimenzation systems 

Any ligand for which a binding protein or ligand binding domain is known or can be 
identified may be used in combination with such a ligand binding domain in carrying out this 
invention. 

Extensive guidance and examples are provided in WO 94/1 8317 for ligands and 
other components useful for cross-linked oligomenzation-based systems. Systems based 
on ligands for an immunophilin such as FKBP, a cyclophilin. and'or FRB domain, are of 
special interest Illustrative examples of ligand binding domain/l.gand pairs that may be 
used for cross-linking include, but are not limited to: FKBP/FK1012 . FKBP/ synthetic 
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d.vatent FKBP ligands (see WO 96/06097 and WO 97/31898). FRB/rapamycin or analogs 
thereof:FKBP (see e.g., WO 93/33052, WO 96/41865 and Rivera et al. "A humanized 
system for pharmacologic control of gene expression", Nature Medicine 2<9):1 028-1 032 
(1997)) cydophilirvcyciosporin (see e.g. WO 94/18317). FKBP/FKCsA/cyclcphil.n (see 
e g Belshaw et al 1996, PNAS 93:4604-4607), DHFR/methotrexate (see e.g. L.c.tra et al. 
1996, Proc. Natl. Acad. Sci. USA 93:12817-12821), and DNA gvrase/coumermyc.n (see 
eg Farraretal 1996. Nature 383:178-181). Numerous variations and modifications to 
ligands and ligand binding domains, as well as methodologies for designing, selecting 
and/or characterizing them, wh.ch may be adapted to the present invention are disclosed ,n 
the cnec references. 

Allostery-based systems 

For additional ouioance on ligands for other systems which may be adapted to this 
invention, see e.g. (Gosser, and Bujard Proc. Natl. Acad. Sci. U.S.A. 1992 89:5547. and 
US Patent Nos. 5654168. 5650298. 5589362 and 5464758 (TetRAetracycl.ne). Wang et al, 
1994, Proc. Natl. Acad. Sc.. USA 91:81 80-81 84 (progesterone receptor/RU486), and No et 
al, 1996. Proc. Natl. Acad. Sc:. USA 93:3346-3351 (ecdysone receptor/ecdysone). 



DNA-binding domains 

Regulated express.on systems relevant to this invention involve the use of a 
protein containing a DNA bind.ng domain to selectively target a desired gene for 
expression (or repression). Systems based on l.gand-mediatea cross-linking generally rely 
upon a fusion prote:n contain.ng the DNA b.nding domain together with one or more ligand 
b.nding domains. One general advantage of such systems is that they are part.cularly 
modular in nature and lend themselves to a wide variety of des,gn choices. These systems 
permit wide latituoenn the choice of DNA binding domains. Many allostery-based 
systems, like the TetR- and progesterone-R-based systems, use a fusion prote.n 
containing a DNA binding domain together with a transcription regulatory doma.n (e.g. a 
transcription activation or repression domain). Some allostery-based systems such as the 
ecdysone-regulated system, use a protein like RXR which contains a DNA binding domain 
together with a bind.ng site for another protein (such as the ecoysone receptor). Of the 
allostery-based systems, the progesterone receptor-based system and like systems 
permit relatively greater latitude .n the choice of DNA binding domain. While allcstery-oased 
systems like the TetR- and ecdysone receptor tyoe may be eng.neered at the DNA 
binding domain, they are somewhat less amenable to ready replacement ot the DNA 
binding domain. 

Vanous DNA binding domains may oe incorporated into the design of fusion 
proteins of this invention, especially those of the ligand-mediated cross-linking type and 
the progesterone-R-based type, so long as a corresponding DNA -recognition" sequence 
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is known, or can be identified, to which the domain is capable of binding. One or more 
copies of the recognition sequence are incorporated .nto, or present with.n. the expression 
control sequence of the target gene construct. Peptide sequence of human ong.n is often 
preferred where available, for uses in human gene therapy. Composite DNA binamg 
domains provide one means for achieving novel sequence specificity for the protem-DNA 
b.ndmg interaction. An illustrative composite DNA binding domain contam.ng component 
peptide sequences of human origin is ZFHD-1 which is described in detail below. 
Indivraual DNA-bind,ng domains may be further modified by mutagenes.s to Decrease, 
mcrease, or change the recognition specificity of DNA binding. These modifications can be 
achieved by rational des.cn of substitutes in positions known to contnbute to DNA 
recognition (often baseo on homology to related prote.ns for wh:ch explicit structural data 
are available). 

For example in the case of a homeodomam, substitutions can be made .n ammo 
acds in the N-terminal arm, first loop, second helix, and third helix known to contact DNA. In 
zinc fingers, subnotions can be made at selected positions in the DNA recognition hel.x: 
Alternatively, random methods, such as selection from a phage display library can be used 
to identify altered domains with increased affinity or altered specificity. 

For additional examples, information and guidance on designing, mutating, select.ng, 
combining and characterizing DNA binding domains. se€, e.g.. Pomerantz JL, Wolfe SA, 
Pabo CO, Structure-based design of a dimeric zinc finger protein Biochemistry 1 998 Jan 
27 37(4):965-970; Kim J-S and Pabo CO, Getting a Handhold on DNA: Design of Poly- 
Zinc Finger Proteins with Femtomolar Dissociation Constants, PNAS USA, 1998 Mar 
17-Q5(6)-2812-2817; Kim JS, Pabo CO, Transcriptional repression by zinc finger 
peptides. Exploring the potential for applications in gene therapy. . J Biol Chem 1997 Nov 
21;272(47):29795-29800; Greisman HA. Pabo CO . A general strategy for selecting h.gh- 
affi'nity zinc finger prote.ns for diverse DNA target sites. Science 1997 Jan 
31 275(5300) :657-661 ; Rebar EJ, Greisman HA, Pabo CO, Phage display methods for 
selecting zinc finger proteins with novel DNA-b,nd.ng specificities, Methods Enzymol 
1996 267 129-149; Pomerantz JL. Pabo CO, Sharp PA , Analys.s of homeodomain 
function by structure-based design of a transition factor, Proc Natf Acad Sc. U S A 1995 
Oct 10;92(21 ):9752-9756; Rebar EJ, Pabo CO, Zinc finger phage: affinity selection of 
fingers' with new DNA-b.nd.ng specificit.es, Science 1994. Feb 4;263:671 -673; Choo Y, 
Sanches-Garcia I, Klug A, In v.vo repression by a site-specific DNA-b.nd.ng prote.n 
designed against an oncogenic sequence. Nature 1994, Dec 15:372:642-645; Choo Y. 
Klug A Toward a code for the interaction of zinc fingers w.th DNA: Selection of randomized 
fingers' displayed on phage. PNAS USA, Nov 1994; 91:11163-1 1 167; Wu H. Yang W-P, 
Barbas CF III, Building zinc fingers by selection: toward a therapeutic acpHcat.cn. PNAS 
USA January 1995. 92:344.348; Jam.eson AC, K.m S-H. Wells JA, In Vrtro selection of 
zinc fingers with altered DNA-bind.ng specificity, B.ochem.stry 1994. 33:5689-5695. 
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international patent applications WO 96/20951 . WO 94/18317, WO 96/06166 and WO 
95/19431 ; and USSN 60/084819. 

Additional domains and linkers 

Additional domains may be included in the fusion proteins of this invention. 
For example, the fusion proteins may contain a nuclear localization sequence (NLS) 
which provides for the protein to be translocated to the nucleus. A NLS can be located at 
the N-terminus or the C-termmus of a fusion protein, or can be located between component 
portions of the fusion protein, so long as the function of fusion prote.n and its components 
is disrupted by presence of the NLS. Typically a nuclear localization sequence has a 
plurality of basic amino acios. referred to as a bipartite basic repeat (reviewed in Garcia- 
Bustos et al. (1991 ) Biochimica et Biophysica Acta 1071:83-101). One illustrative NLS is 
derived from the NLS of the SV40 large T antigen which is comprised of amino acids 
proline-lysine-lysine-lysine-arginme-lysine-val.ne (Kalderon et al. (1984) Cell 39:499-509). 
Another illustrative NLS is derived from a p53 protein. Wild-type p53 conta.ns three C- 
termmal nuclear localization signals, compnsing residues 316-325, 369-375 and 379-384 of 
P 53 (Shaulsky et al. (1990) Mol. Cell. Biol. 10:6565-6577). Other NLSs are described by 
Shaulsky et al (1990) supra and Shaulsky et al. (1991) Oncogene 6:2056. 

To facilitate their detection and/or purification, the fusion proteins may contain 
peptide portions such as "histidine tags", a glutathicne-S-transferase domain or an 
-epitope tag" which can be recognized by an antibody. 

The intervening distance and relative onentation of the various component domains 
of the fusion proteins can be varied to optimize their production or performance. The design 
of the fusion proteins may include one or more linkers", compnsing peptide sequence 
(which may be naturally-occurring or not) separating individual component polypeptide 
sequences. Many examples of linker sequences, their occurrence in nature, their des.gn 
and their use in fusion proteins are known. See e.g. Huston et al. (1988) PNAS 85:4879; 
U.S. Patent No. 5,091 ,513; and Richardson et al. (1988) Science 240:1 648-1 652. 

3C Target gene constructs 

A target gene construct comprises a gene of mteresl operably linked to an 
express,on control sequence which permits ligand-regulated expression of the gene. More 
specifically, such a construct typically compnses: (1 ) one or more copies of a DNA 
sequence recognized by a DNA binding domain of a fusion protein of tne invention lor by 
25 a DNA binding protein like RXR which binds to a fusion protein of the invention); (2) a 
promoter sequence consisting minimally of a TATA box and initiator sequence but 
optionally including other transcnption factor binding sites; (31 sequence encoding the 
desired product, including sequences that promote the initiation and termination of 
translation, if appropriate; (4) an optional sequence consisting of a splice donor, splice 
acceptor, and intervening .ntron DNA; and (5) a sequence directing cleavage and 
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polyadenylation of the resulting BNA transcript. Typically the construct contains a copy of 
the target gene to be expressed, operably linked to an expression control sequence 
comprising a minimal promoter and one or more copies of a DMA recognition sequence 
responsive to the transcription factor. 

(a) Target genes 

A wide variety of genes can be employed as the target gene, including genes that 
encode a therapeutic protein, antisense sequence or nbozyme of interest, or any other 
protein which is of therapeutic or scientific interest. The target gene (and there may be . 
multiple target genes) can encode a gene product which provides a desired pnenotype. If 
can encode a membrane-DOund or membrane- spanning protein, a secreted protein, or a 
cytoplasmic protein. The proteins which are expressed, singly or in combination, can 
involve homing, cytotoxicity, proliferation, differentiation, immune response, inflammatory 
response, clotting, tnromboiysis, hormonal regulation, angiogenesis, etc. The polypeptide 
encoded by the target gene may be of naturally occurring or non-naturalty occurrnng 
peptide sequence. 

Various secreted products include hormones, such as insulin, human growth 
hormone, glucagon, pituitary releasing factor, ACTH. melanotropin, relaxin, leptin.efc; 
growth factors, such as EGF. IGF-1 . TGF-alpha. -beta, PDGF. G-CSF, M-CSF, GM-CSF, 
FGF. erythropoietin, thrombcpoietin, megakaryocyte growth factors, nerve growth factors, 
etc.; proteins which stimulate or inhibit angiogenesis such as angiostatin, endostatin and 
VEGF and variants thereof; interieukins. such as IL-1 to -15; TNF-alpha and -beta; and 
enzymes and other factors, such as tissue plasminogen activator, members of the 
complement cascade, perfonns, superoxide dismutase; coagulation-related factors such as 
2= antithrombin-lll. Factor V. Factor VII, Factor Vlllc, vWF, Factor IX, alpha-anti-trypsin. 

protein C, and protein S, endorphins, dynorphin. bone morphogenetic protein, CFTR, efc. 

The gene can encode a naturally-occurring surface membrane protein or a protein 
made so by introduction of an appropriate signal peptide and transmembrane sequence. 
Various such proteins include homing receptors, e.g. L-selectin (Mel-14). hematopoietic cell 
30 markers, e.g. CD3, CD4, CD8, B cell receptor, TCR subunits alpha, beta, gamma or delta, 
CD10. CD19, CD28, CD33, CD38, CD41 , efc. receptors, such as the interleukin 
receptors 1L-2R, IL-4R, etc.; receptors for other ligands including the various hormones, 
growth factors, etc., receptor antagonists for such receptors and soluble forms of such 
receptors; channel proteins, for influx or efflux of ions. e.g. H+. Ca+2, K+, Na~, CI", efc, 
35 and the like; CFTR, tyrosine activation motif, zap-70, etc. 

Proteins may be modified for transport to a vesicle for exocytosis. By adding the 
sequence from a protein which is directed to vesicles, where the sequence is modified 
proximal to one or the other terminus, or situated in an analogous position to the protein 
source, the modified protein will be directed to the Golgi apparatus for packaging in a 
40 vesicle. This process in con]unction with the presence of the chimeric proteins for 
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exocytcsis allows for rapid transfer of the proteins to the extracellular medium and a 
reiativelv hiah localized concentration. 

The'target gene product can be an .ntracellular protein such as a protein involved in 
a metabolic pathway, or a regulatory protein, steroid receptor, transcnpt.cn factor, etc.. 

By way of further illustration, in T-cells, one may wish to introduce genes encoding 
one or botn chains of a T-cell receptor. For B-cells, one could provide the heavy and light 
chains for an immunoglobulin for secretion. For cutaneous cells, e.g. keratmocytes. 
particular* keratinocyte stem cells , one could provide for protection against ,nfect>on, by 
secreting alpha, beta or gamma interferon, antichemotactic factors, proteases specific for 
10 bacterial cell wall proteins, various anti-viral proteins.efc. 

In various situations, one may w.sh to direct a cell to a particular srte. The srte can 
mcluae anatomical sites, such as lymph nodes, mucosal tissue, sk.n, synov.um. lung or 
other .ntemal organs or funct.onal sites, such as clots, injured sites, sites of surg.cal 
manipulation, inflammation, infection, etc. Regulated expression of a membrane protein 
,* which recogn.zes or binds to the particular site of interest, for example, provides a method 
for directing the engineered cells to that srte. Thus one can achieve a localized 
concentration of a secreted product or effect cell-based healing, scavenging, protection from, 
infection anti-tumor activity, etc. Proteins of interest include homing receptors, e.g. L- 
selectin GMP140. CLAM-1 . etc.. or addressins, e.g. ELAM-1 . PNAd. LNAd. etc. clot 
20 binding proteins, or eel! surface proteins that respond to localized gradients of chemotactic 

factors. . 

In one embodiment, recognition elements for a DNA binding domain of one of the 
subject fusion prote.ns are introduced into the host cells such that they are operatrvely 
linked to an endogenous target gene, e.g. by homologous recombination with genomic 
2 c DNA A variety of suitable approaches s are available. See, e.g.. PCT publications 
WO9a'09222. W095/31 560. WOS6/2941 1 , WO95/31560 and WO94/12650. This 
permits ligand-mediated regulation of the transenption of the endogenous gene. 
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(b) Minimal Promoters. 

Minimal promoters which may be incorporated into a target gene construct (or other 
construct of the invention) may be selected from a w,de variety of known sequences, 
including promoter regions from fos, hCMV, SV40 and IL-2. among many others. Illustrative 
examples are providec wh.ch use a minimal CMV promoter or a minimal IL2 gene promoter 
(-72 to +45 with respect to the start site, Siebenlist et al.. MCB 6:3042-3049. 1986) 

(c) DNA recognition sequences. 

The choice of recognition sequences to use in the targe: gene construct is in some 
cases determined by the nature of the regulatory system to be employed. 
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Where the target gene construct comprises an endogenous gene with its own 
regulatory DNA, the recognition sequence is thereby provided by the cells, and the 
practitioner prov.des a DNA binding domain which recognizes it. 

In systems relying on a tetR or RXR-type DNA binding domain, the recognition 
seauence is again ususally predetermined (by the choice of tetR or RXR-type DNA 
binding domain). 

In other cases e.g.. in ligand-mediated crosslinking systems and systems like tne 
progesterone receptor-based system, a diverse set of DNA binding doma.mrecognit.on 
sequence choices are available to the practitioner. 

Recognition sequences for a wide vanety of DNA-bindmg doma.ns are known. 
DNA recognit.on sequences for other DNA bind.ng domains may be determined 
experimentally. In the case of a comoosrte DNA binding domain, DNA recognition 
sequences can be determined experimentally, or the proteins can be manipulated to direct 
their specif.city toward a desired sequence. A desirable nucleic acid recognition sequence 
for a composite DNA bind.ng domain consists of a nucleotide sequence spanning at least 
ten, preferably eleven, more preferably twelve or more, and even more preferably in some 
cases eighteen bases. The component binding portions (putative or demonstrated) within 
the nucleotide sequence need not be fully contiguous; they may be interspersed with 
"spacer" base pairs that need not be directly contacted by the chimenc protein but rather 
impose proper spacing between the nucleic acid subsites recognized by each module. 
These sequences should not impart expression to linked genes when introduced into cells 
in the absence of the engineered DNA-bindmg protein. 

To identify a nucleotide sequence that is recognized by a chimeric protein 
containing a DNA-b.ndmg region, preferably recognized with high affinity (dissociation 
constant 10" 1 1 M or lower are especially preferred), several methods can be used. If high- 
aff.nity binding sites for individual subdomains of a composite DNA-bindmg region are 
already known, tnen these sequences can be joined with various spacing and onentation 
and the optimum configuration determined expenmentally (see below for methods for 
determining affinities). Alternatively, high-affinity binding sites for the protein or protein 
complex can be selected from a large pool of random DNA sequences by adaptation of 
published methods (Pollock. R. and Treisman, R.. 1990. A sensitive method for the 
determination of proteirv-DNA binding specificities. Nucl. Aads Res. 18. 6197-6204). Bound 
seouences are cloned into a plasmid and their precise sequence and affinity for the 
proteins are determined. From this collection of sequences, individual sequences with 
cesirable charactenst.es (i.e.. maximal affinity for composite protein, minimal affinrty for 
individual subdomains^ are selected for use. Alternatively, the collection of seouences is 
used to derive a consensus sequence that carries the favored base pairs at eacn position 
Such a consensus sequence is synthesized and tested to confirm that it has an 
appropriate level of affinity and specificity. 
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The target gene constructs may contain multiple copies of a DNA recognition 
sequence. For .nstance. the constructs may contain 5, 8. 10 or 12 recogn,t.on sequences for 
GAL4 or for ZFHD1 . 

Design and assembly of the DNA constructs 

Constructs may be designed in accordance with the principles, illustrative 
examples and materials and methods disclosed in the patent documents ana scientific 
literature cited herein, with modifications and further exemplification as aescribed. 
Components of the constructs can be prepared in conventional ways, where the coding 
sequences and regulator,- regions may be isolated, as appropr.ate, ligated, coned in an 
appropriate cloning host, analyzed by restriction or sequencing, or other convenient 
means. Particularly, using PCR, individual fragments including all or portions of a functional 
unit may be isolated, where one or more mutations may be introduced using "pnmer repair", 
ligation, in vitro mutagenesis, etc. as appropnate. In the case of DNA constructs encod.ng 
fusion proteins, DNA sequences encoding individual domains and sub-domains are joined 
such that they constitute a s.ng.e open reading frame encoding a fus.on protein capable of 
being translated in cells or cell lysates into a single polypeptide haroonng all component 
domains. The DNA construct encoding the fusion protein may then be placed into a vector 
for transducing host cells and permitting the expression of the protein. For biochemical 
analysis of the encoded chimera, it may be desirable to construct plasmids that direct the 
expression of the protein in bacteria or in reticulocyte-lysate systems. For use in the 
production of proteins in mammalian cells, the protein-encoding sequence is introduced into 
an expression vector that directs expression in these cells. Expression vectors suitable for 
such uses are well known in the art. Various sorts of such vectors are commercially 
25 available. 

Introduction of Constructs into Cells 

This invention is particularly useful for the engineering of animal cells and in 
applications involving the use of such engineered animal cells. The animal cells may be 
insect, worm or mammalian cells. While vanous mammalian cells may be used, including, by 
way of example, equine, bovine, ovine, canine, feline, murine, and non-human primate 
cells human and mouse cells are of particular interest. Across the various species, vanous 
types of cells may be used, such as Hematopoietic, neural, glial, mesenchymal, cutaneous, 
mucosal stromal, muscle (including smooth muscle cells), spleen, reticuloendothelial, 
epithelial, endothelial, hepatic, kidney, gastrointestinal, pulmonary, fibroblast, and other cell 
types. Ol particular interest are muscle cells (induding skeletal, cardiac and other muscle 
cells), cells of the central and penpheral nervous systems, and hematopoietic cells, wn.cr, 
may include any of the nucleated cells which may be involved with the erythroid, 
lymphoid or myelomonocytic lineages, as well as myoblasts and fibroblasts. Also of 
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interest are stem and-progenitor cells, such as hematopoietic, neural, stromal, musde, 
hepatic, pulmonary, gastrointestinal and mesenchymal stem cells 

The cells may be autologous cells, syngeneic cells, allogeneic cells and even in 
some cases, xenogeneic cells with respect to an intended host organism. The cells may 
be modified by changing the major histocompatibility complex CMHC) profile, by 
,nact.vat,ng (^-microglobulin to prevent the formation of functional Class I MHC molecules, 
mactivaticn of Class II molecules, providing for expression of one or more MHC molecules, 
enhancing or mactivating cytotoxic capabilities by enhancing or inhibiting the express.on of 
genes associated with the cytotoxic activity, and the like. 
1C " In some instances soecific clones or oligoclonal cells may be of .merest, where the 
cells have a particular specif.city, such as T cells and B cells having a specific antigen 
specificity or homing target site specificity. 

Constructs encoding the fusion proteins and comprising target genes of this 
invention can be introduced into the cells as one or more nucleic acid molecules or 
1 5 constructs, in many cases in association wrth one or more markers to allow for selection of 
host cells which contain the construct(s). The constructs can be prepared in conventional 
ways, where the cod.ng sequences and regulatory regions may be isolated, as 
appropnate, ligated, cloned in an appropriate doning host, analyzed by restriction or 
sequencing, or other convenient means. Particularly, using PCR, individual fragments 
20 including all or portions of a functional domain may be isolated, where one or more 

mutations may be introduced using "primer repair*, ligation, in vitro mutagenesis, efc. as 
appropriate. 

The construct(s) once completed and demonstrated to have the appropriate 
sequences may then be introduced into a host cell by any convenient means. The 
25 constructs may be incorporated into vectors capable of episomal replication (e.g. BPV or 
EBV vectors) or into vectors designed for integration into the host cells" chromosomes. The 
constructs may be integrated and packaged into non-replicat.ng, defedive viral genomes 
like Adenovirus, Adeno-associated virus (AAV), or Herpes simplex virus (HSV) or others, 
including retroviral vectors, for infection or transduction into cells. Alternatively, the 
30 construct may be introduced by protoplast fusion, electroporation, biolistics, calcium 

phosphate transfection, l.pofection, microinjection of DNA or the like. The host cells will in 
some cases be grown and expanded in culture before introduction of the construct(s), 
followed by the appropriate treatment for introduction of the construct(s) and integration of 
the construct(s). The cells will then be expanded and screened by virtue of a marxer 
35 present in the constructs. Vanous markers which may be used successfully indude hprt. 
neomycin resistance, thymidine kinase, hygromyan resistance, etc.. and various cell- 
surface markers such as Tac, CD8, CD3. Thyt and the NGF receptor. 

In some instances, one may have a target site for homologous recombination, 
where it is desired that a construct be integrated at a particular locus. For example, one can 
40 delete and/or replace an endogenous gene (at the same locus or elsewhere) wrth a 
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recombinant target construct of this invention. For homologous recomb.nat.on. one may 
generally use either Q or O-vectors. See. tor example. Thomas and Capecch,, Cel. (1987^ 
51, 503-512; Mansour, era/.. Nature (1988) 336, 348-352; and Joyner, eta/.. Nature 

(1989) 338. 153-156. 

The constructs may be introduced as a single DNA molecule encod.ng all of the 
genes or different DNA molecules having one or more genes. The constructs may be 
.ntroduced simultaneously or consecutively, each with the same or different marKers. 

Vectors containing useful elements such as bacterial or yeast ong.ns ot replication, 
selectable and/or amplifiable markers, promoter/enhancer elements for expression in 
prokaryotes or eukaryotes. and mammalian expression control elements, etc. wh.ch may 
be used to prepare stocks of construct DNAs and for carrying out transfections are well 
known in the art. and many are commercially available. 

Introduction of Constructs into Animals 

Any means for the introduction of genetically eng.neered cells or heterologous DNA 
into animals preferably mammals, human or non-human, may be adapted to the practice of 
this invention for the delivery ol the vanous DNA constructs into the intended reapient. For 
the purpose of this d.scussion, the vanous DNA constructs described here.n may together 
be referred to as the transgene. 

by ex vivo genetic engineering 

Cells which have been transduced ex vivo or in vitro with the DNA constructs may 
be grown in culture unoer selective conditions and cells which are selected as having the 
desired construd(s) may then be expanded and further analyzed, using, for example, the 
polymerase chain reaction for determining the presence of the construct in the host cells 
and/or assays for the products of the desired gene product(s). After being transduced 
with the heterologous oenet.c constructs, the modified host cells may be identified, 
selected, gorwon. characterized, etc. as des,red, and then may be used as plannec. e.g. 
grown in culture or introduced into a host organism. 

Depending upon the nature of the cells, the cells may be introduced into a host 
organ,sm e.g. a mammal, in a wide vanety of ways, generally by injection or implantation 
into the desired tissue or compartment, or a tissue or compartment permrtting migration of 
the cells to their intended destination. Illustrative sites lor injection or .molantaticn ,r,clude the 
vascular system, bone marrow, muscle, liver, cranium or sp.nal cord, peritoneum, and SKin. 
Hematopoietic cells, for example, may be administered by injection into the vascular 
system, there being usually at least about 10* cells and generally not more than about 
1010 cells. The number of cells which are employee will depend upon the circumstances, 
the purpose for the introduction, the lifetime of the cells, the protocol to be used, for 
example, the number of administrations, the ability ol the cells to multiply, the stability ol 
the therapeutic agent, the physiolog.c need for the theraoeut.c agent, and the like. 
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Generally, 1or myoblasts or fibroblasts lor example, the number of cells will be at least 
about 104 and not more than about 109 and may be applied as a dispersion, generally 
being injected at or near the site of interest. The cells will usually be in a physiologically- 
acceptable medium. 

Cells engineered in accordance with this invention may also be encapsulated, e.g. 
using conventional biocompatible matenals and methods, pnor to implantation into the host 
organism or patient for the production of a therapeutic protein. See e.g. Hguyen et al. 
Tissue Implant Systems and Methods for Sustaining viable High Cell Densities within a 
Host, US Patent No. 5,314,471 (Baxter International. Inc.); Uludag and Sefton. 1993, J 
Biomed. Mater. Res. 27(10):1213-24 (HepG2 cells/hydroxyethyl methacrytate-methyl 
methacrylate membranes); Chang et al, 1 993, Hum Gene Ther 4(4);433-40 (mouse Ltk- 
cells expressing hGH/immunoprotective perm-selective alginate microcapsules; Reddy et 
al, 1993, J Infect Dis 168(4):1 082-3 (alginate); Tai and Sun, 1993. FASEB J 7(1 1);1061-9 
(mouse fibroblasts expressing hGH/alg.nate-poly-L-lysine-alginate membrane); Ao et al, 
1995, Transplantation Proc. 27(6):3349, 3350 (alginate); Rajotte et al, 1995. 
Transplantation Proc. 27(6):3389 (alginate); Lakey et al, 1995, Transplantation Proc. 
27(6):3266 (alginate); Koroutt et al, 1995, Transplantation Proc. 27(6);3212 (alginate); 
Donan et al. US Patent No. 5,429.821 (alginate); Emerich et al. 1993, Exp Neurol 
122(1):37-47 (polymer-encapsulated PC12 cells); Sagen et al, 1993, J Neurosci 
a 1 3(6):241 5-23 (bovine chromaffin cells encapsulated in semipermeable polymer membrane 
and implanted into rat spinal subarachnoid space); Aebischer et al. 1994, Exp Neurol 
126(2):151-8 (polymer-encapsulated rat PC12 cells implanted into monkeys; see also 
Aebischer. WO 92/19595); Savelkoul et al, 1994, J Immunol Methods 170(2):1 85-96 
(encapsulated hybridomas producing antibodies; encapsulated transfected cell lines 
!5 expressing various cytokines); Winn et al, 1 994, PNAS USA 91 (6)2324-8 (engineered 
BHK cells expressing human nerve growth factor encapsulated in an immunoisolation 
polymeric device and transplanted into rats); Emerich et al, 1994, Prog 
Neuropsychopharmacol Biol Psychiatry 1 8(5);935-46 (polymer-encapsulated PC12 cells 
implanted into rats); Kordower et al. 1994. PNAS USA 91 (23):1 0898-902 (polymer- 
30 encapsulated engineered BHK cells expressing hNGF implanted into monkeys) and Butler 
et al WO 95/04521 (encapsulated Oevice). The cells may then be introduced in 
encapsulated form into an animal host, preferably a mammal and more preferably a human 
suDject in need thereof. Preferably tne encapsulating matenal is semipermeable, permitting 
release into the host of secreted proteins proouced by the encapsulated cells. In many 
35 embodiments the semipermeable encapsulation renders the encapsulated cells 

immunologically isolated from the host organism in which the encapsulated cells are 
introduced. In those embodiments the cells to be encapsulated may express one or more 
fusion proteins containing component domains derived from proteins of the host species 
and/or from viral proteins or proteins from species other than the host species. The cells 
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may be derived from one or more individuals other than the recipient and may be denved 
from a species other than that of the recipient organism or patient. 

by in vivo genetic engineering 

Instead of ex vivo modification of the cells, in many situations one may wish to 
modify cells in vivo. A vanety of techniques have been developed for genet.c engineering 
of target tissue and cells in vivo, including viral and non-viral systems. 

In one approach, the DNA constructs are delivered to cells by transfect.on, i.e., by 
delivery to cells of "naked DNA", lipid-complexed or liposome-formulated DNA. or otherwise 
formulated DNA. Prior to formulation of DNA, e.g., with lipid, or as in other approaches, pnor 
to incorporation in a final expression vector, a plasmid containing a transgene bearing the 
desired DNA constructs may first be experimentally optimized for express.on (e.g.. 
inclusion of an .ntron in the 5' untranslated region and elimination of unnecessary 
sequences (Feigner, et al., Ann NY Acad Sd 126-139. 1995). Formulation of DNA, e.g. 
with various lipid or liposome materials, may then be eflected using known methods and 
materials and delivered to the recipient mammal, See. e.g.. Canonico et al, Am J Respir 
Cell Mol Biol 1024-29. 1994 (in vivo transfer of an aerosolized recombinant human alpha 1- 
antitrypsin gene complexed to cationic liposomes to the lungs of rabbits); Tsan et al, Am J 
Physiol 268 (Lung Cell Mol Physiol 12): L1052-L1056. 1995 (transfer of genes to rat lungs 
via tracheal insufflation of plasmid DNA alone or complexed with cationic liposomes); Alton 
et al Nat Genet. 5:135-142, 1993 (gene transfer to mouse airways by nebulized delivery 
of cDNA-liposome complexes). In either case, delivery of vectors or naked or formulated 
DNA can be carried cut by instillation via bronchoscopy, after transfer of viral particles to 
Ringers, phosphate buffered saline, or other similar vehicle, or by nebulization. 

V.ral systems include those based on viruses such as adenovirus, adeno- 
associated virus, hybnd adeno-AAV, lentivirus and retroviruses, which allow for 
transduction by infection, and in some cases, integration of the virus or transgene into the 
host genome. See, for example, Dubensky et al. (1984) Proc. Natl. Acad. Sc. USA 81 , 
7529-7533- Kaneda et al.. (1989) Science 243,375-378; Hiebert et al. (1989) Proc. Natl. 
Acad Sci USA 86, 3594-3598; Hatzoglu et al. (1990) J. Biol. Chem. 265. 17285-17293 
and Ferry, et al. (1991) Proc. Natl. Acad. Sci. USA 88. 8377-8381 . The virus may be 
administered by injection (e.g. intravascularly or intramuscularly), inhalation, or other 
parenteral mode. Non-viral delivery methods such as administrate of the DNA via 
complexes with liposomes or by injection, catheter or biolist.cs may also be used See e.g. 
WO 96/41865 PCT/US97/22454 and USSN 60/084819. for example, for additional 
guidance on formulation and deliver/ of recombinant nucleic acids to cells and to organisms. 

By employing an attenuated or modified retrovirus carrying a target transcriptional 
initiation region, if desired, one can activate the virus using one of the sub,ect transcription 
factor constructs, so that the vims may be produced and transduce ad)acent cells 
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The use of reccmcinant viruses to deliver the nucleic ac:d constructs are of 
particular interest. The transgene(s) may be incorporated into any of a variety of viruses 

useful in gene therapy. 

in clinical settings, the gene delivery systems (i.e., the recombinant nucieic acos in 
vectors, virus, lipid formulation or other form) can be introduced into a patient, e.g., by any 
of a number of known methods. For instance, a pharmaceutical preparation of the gene 
delivery system can be introduced systemically. e.g. by intravenous injection, inhalation, 
etc. In some systems, the means of delivery prov.des for specific or selective transauction 
of tne construct into desired target cells. This can be achieved by regional or local 
administration (see U.S. Patent 5,328,470) or by stereotactic injection, e.g. Chen et a!.. 
(1994) PNAS USA 91 : 3054-3057 or by oeterminants of the delivery means. For instance, 
some viral systems have a tissue or cell-type specificity for infection. In some systems 
cell-typo or tissue-type expression is achieved by the use of cell-type or tissue-specific 
expression control elements controlling expression of the gene. 

Those references as well as tne references cited previously, including those 
relating to tetfi-based systems, progesterone-receptor-based systems and ecdysone- 
based systems, provide cetailed additional guidance on the preparation, formulation and 
delivery of various ligands to cells in vitro and to organisms. 

In preferred embodiments of the invention, the subject expression constructs are 
denved by incorporation of the genetic construct(s) of interest into viral delivery systems 
including a recombinant retrovirus, adenovirus, adeno-asscciated virus (AAV), hybrid 
adenovirus/AAV, herpes virus or lentivirus (although other applications may be earned out 
using recombinant bactena! or eukaryot.c plasmids). While various viral vectors may be 
used in the practice of this invention, AAV- and adenovirus-based approaches are of 
particular interest for the transfer of exogenous genes in vivo, particularly into humans and 
other mammals. The following additional guidance on the choice and use of viral vectors 
may be helpful to the practitioner, especially with respect to applications involving whole 
animals (including both human gene therapy and the development and use of animal mode! 
systems), whether ex vivo or in vivo. 

Viral Vectors: 
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Adenoviral vectors 

A viral gene delivery system useful in the present invent.cn utilizes aoenovirus- 
derived vectors. Knowledge of the genetic organization of adenovirus, a 36 kB, linear and 
oouble-stranded DNA vinjs. allows suostitution of a large p.ece of adenoviral DNA with 
foreign sequences uo to 8 kB. In contrast to retrovirus, the infection of adenoviral DNA into 
host cells does not result in chromosomal integration because adenoviral DNA can replicate 
in an episomal manner without potential genotox.crty. Also, adenoviruses are structurally 
stable and no genome rearrangement has been detected after extensive amplification. 



42 



WO 99*10510 PCT/CS98/1T723 

Adenovirus can m<ect virtually all epithelial cells regardless of their cell cycle stage. So tar, 
adenoviral infection appears to be linked only to mild disease such as acute respiratory 

disease in the human. 

Acenovirus is particularly suitable for use as a gene transfer vector because of its 
5 mid-sized genome, ease of manipulation, high titer, wide target-cell range, and high 

mfectivity. Both ends of the viral genome contain 100-200 base pair (bp) inverted terminal 
repeats (ITR), which are cis elements necessary for viral DNA rephcat.on and packaging. 
The early (E) and late (L) regions of the genome contain different transcription domains tnat 
are divided by the onset of viral DNA replication. The E1 region (E1 A and E1B) encodes 
, c proteins responsible for the regulation cf transcnotion of the viral genome and a few cellular 
genes The expression of tne E2 region (E2A and E2B) results in the synthesis of the 
proteins for viral DNA replication. These proteins are involved in DNA replication, late gene 
expression, and host cell shut ofi (Renan (1990) Radiotherap. Oncol. 19:197). The 
products of the late genes, including the majority of the viral caps.d proteins, are expressed 
is only after significant processing of a single pnmary transcnpt issued by the major late 
promoter (MLP). The MLP (located at 16.8 m.u.) is particularly efficient dunng the late 
pnase of infection, and all the mRNAs issued from this promoter possess a 5' tripartite 
leader (TL) sequence which makes them preferred mRNAs for translation. 

The genome of an adenovirus can be manipulated such that it encodes a gene 
2 c product of interest, but is inactivated in terms of its ability to replicate in a normal lytic viral 
life cycle (see, for example, Berkner et al., (1988) BioTechniques 6:616; Rosenleld et a!., 
(1991) Science 252:431-434; and Rosenfeld et al., (1992) Cell 68:143-155). Suitable 
adenoviral vectors derived from the adenovirus strain Ad type 5 d!324 or other strains of 
adenovirus (e.g.. Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. 
25 Recombinant adenoviruses can be advantageous in certain circumstances in tnat they are 
not capable of infecting nond.vid.ng cells and can be used to infect a wide variety of cell 
types, including airway epithelium (Rosenfeld et al., (1992) cited supra), encotheiia! cells 
(Lemarchand et al.. (1992) PNAS USA 89:6482-6466), hepatocytes (Herz and Gerard, 
(1993) PNAS USA 90:2812-2816) and muscle cells (Quantin et a!., (1992) PNAS USA 
30 89:2581 -2584). Adenovirus vectors have also been used in vaccine development 
(Grunhaus and Horwitz (1992) Seminar in Virology 3:237; Graham and Prevec (1992) 
Biotechnology 20:363). Expenments in acministering recombinant adenovirus to different 
tissues mciude tracrtea instillation (Rosenfeld et al. (1991) ; Rosenfeld et al. (1992) Cell 
68-143) muscle imecaon (Ragot et al. (1993) Nature 361:647), penpneral intravenous 
25 injection (Herz and Gerard (1993) Proc. Natl. Acad. Sci. U.S.A. 90:2812), and stereotax 
inoculation into the b-ain (Le Gal La Salle et al. (1993) Science 254:988). 

Furthermore the virus particle is relatively stable and amenable to Dunf.cation and 
concentration, and as above, can oe moc.f.ed so as to affect the spectrum of infectiv.ty. 
Additionally, adenovirus is easy to grow and manipulate and exhibits broad host range ,n 
ao wrroand in vivo. This group of viruses can be obtained in high titers, e.g., 109 - 101 1 
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piaoue-forming unit (PFUVml. and they are highly infective. The life cycle of adenovirus 
coes not require imegrat.cn into the host cell genome. The foreign genes del.vered by 
adenovirus vectors are episomal, and therefore, have low genotoxicity to host cells. Nc 
side effects have been reported in studies of vaccination with wild-type adenovirus 
(Couch et al., 1963; Tod et al., 1 971 ). demonstrating their safety and therapeutic potential 
as in vivo oene transfer vectors Moreover, the carry.ng capacity of the adenov.ral 
genome for foreign DNA ,s large (up to 8 kilobases) relative to other gene del-very vectors 
(Berkner et al.. supra. Ha]-Ahmand and Graham (1986) J. Wo!. 57:267). Most repl.cation- 
dpfective adenoviral vectors currently ,n use and therefore favored by the present 
invention are deleted for al! cr pans of tne viral E1 and E3 genes but retain as much as 
80% of the adenoviral genetic material (see. e.g.. Jones et al., (1979) Cell 16:683; Berkner 
et al.. supra: and Graham et al.. in Methods in Molecular Biology, E.J. Murray, Ed. 
(Humana. Clifton, NJ. 1991 ) vol. 7. pp. 109-127). Expression of the inserted gene can be 
under control of, for example, the El A promoter, the major late promoter (MLP) and 
associated leader sequences, the viral E3 promoter, or exogenously added promoter 
sequences. 

Other than the requirement that the adenovirus vector be replication defective, or 
at least conditionally defective, the nature of the adenovirus vector is not believed to be 
crucial to the successful practice of the invention. The adenovirus may be of any of the 42 
different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the 
preferred starting matena! in order to obtain the conditional replication^efective adenovirus 
vector for use in the method of the present invention. This is because Adenovirus type 5 
is a human adenovirus about which a great deal of biochemical and genetic information .s 
known, and it has h.stoncally been used for most constructions employing adenovirus as 
a vector. As stated above, the typical vector according to the present invention is 
replication defective and will not have an adenovirus E1 region. Thus, it will be most 
convenient to introduce the nucle.c aad of interest at the position from which the E1 cocing 
sequences have been removed. However, the position of insertion of the nucleic acd of 
interest in a region with.n the adenovirus sequences ,s not cntical to the present invention. 
For example, the nucleic acid of interest may also be .nserted in lieu of the deleted E3 
region in E3 replacement vectors as described previously by Kansson et. al. (1986) or in 
the E4 region where a helper cell line or helper virus complements the E4 defect. 

A preferred helper eel! line is 293 (ATCC Accession No. CRL1573). Tnis helper 
cell line also termed a -packaging cell line" was developed by Frank Graham (Granam et 
a' (1987) J. Gen. Wo!. 35:59-72 and Graham (1977) J.General Virology 68:937-940) and 
provides E1 A and E1B in trans. However, helper cell lines may also De denved from 
human cells such as human emoryonic k.cney cells, muscle cells, hematopoietic ceils or 
ether human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may 
be denved from the ceils of other mammalian species that are pe-missive for human 
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adenovirus. Such cells include, e.g., Vero cells or other monkey embryon.c mesenchymal 
or epithelial cells. 

Various adenovirus vectors have been shown to be o1 use in the transfer ot genes 
to mammals, .ncludmg numans. Replication-deficient adenovirus vectors have been usee 
to express marker proteins and CFTR in the pulmonary epithelium. Because of their ab.l.ty 
to efficiently infect dividing cells, their tropism for the lung, and the relative ease of 
aenerauon of high titer stocks, adenov,ral vectors have been the subject of mucn research 
in the last few years, and various vectors have been used to deliver genes to the lungs of 
human subjects (Zabner et a!., Cell 75.207-216. 1993; Crystal, et al„ Nat Genet. 8:42-51 . 
1994; Boucher, et al., Hum Gene Ther 5:615-639. 1994). The first generation E1 a deleted 
adenovirus vectors have been improvec upon with a second generation that includes a 
temperature-sensitive E2a viral prote.n. designed to express less viral protein and thereby 
make the virally infected cell less of a target for the immune system (Goldman et al., Human 
Gene Therapy 6:839-851 ,1 995). More recently, a viral vector deleted of all viral open 
reading frames has been reported (F,sher et al.. Virology 217:1 1 -22, 1 996). Moreover, it 
has been snown that expression of viral 11-10 inhibits the immune response to aoenoviral 
antigen (Qin et al., Human Gene Therapy 8:1365-1374. 1997). 

Adenoviruses can also be cell type specific, i.e., infect only restricted types of cells 
and/or express a transoene only in restricted types of cells. For example, the viruses 
comorise a oene uncer the transcriptional control of a transcription initiation reg.cn 
specifically regulated by target host cells, as described e.g.. in U.S. Patent No. 5,698,443. 
by Henderson and Schuur. issued December 16, 1997. Thus, replication competenl 
adenoviruses can be restricted to certain cells by, e.g.. inserting a cell specific response 
element to regulate a synthesis ol a protein necessary for replication, e.g.. E1A or E1B. 

DNA sequences of a number of adenovirus types are available from Genbank. For 
example, human adenovirus type 5 has GenBank Accession No.M73260. The 
adenovirus DNA sequences may be obtained from any of the 42 human adenovirus types 
currently identified. Various adenovirus strains are available from the Amencan Type 
Culture Collection, Rockville. Maryland, or by request from a number of commercial and 
academ.c sources. A transgene as described herein may be incorporated into any 
adenoviral vector and delivery protocol, by the same methods (restnetion digest, linker 
ligation or filling in of ends, and ligation) used to insen the CFTR or other genes into the 
vectors. 

Adenovirus producer cell lines can include one or more cf the adenoviral genes E1 , 
E2a and E4 DNA sequence, for packaging adenovirus vectors in which one or more ot 
these genes have been mutated or deleted are cescrioed. e.g., in PCT/US95/15947 (WC 
98/18418) by Kadan et al.; PCT/US9S'07341 (WO 9S'346671) by Kovesdi et a!.. 
PCT/FR94/00624 (W094/281 52) by Imier et al.;PCT/FR94/00851 (WO 95/02697) by 
Perrocaudet et al.. PCT/US95/1 4793 (WO96/14061) by Wang et al. 
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AAV Vectors 

Another viral vector system useful for delivery of DNA is tne aoeno-associated 
virus (AAV). Adeno-associated virus is a naturally occurring oefective virus that requires 
another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 
reolication and a productive life cycle. (For a review, see MuzyczKa et al., Curr. Topics in 
Micro, and Immunol. (1992) 158:97-129). 

AAV has net been associated with the cause of any disease. AAV is not a 
transforming or oncogenic virus. AAV integration into chromosomes of human cell lines does 
not cause any significant alteration in the growth properties or morphological charactenstics 
of the cells. These properties of AAV also recommend it as a potentially useful human gene 
tnerapy vector. 

AAV is also one of the few viruses that may integrate its DNA into non-dividing 
cells, e.g., pulmonary epithelial cells or muscle cells, and exhibits a high frequency ot stable 
integration (see for example Rone et al.. (1992) Am. J. Resp ( r. Cell. Mol. Biol. 7:349-356; 
Samulski et al., (1989) J. Virol. 63:3822-3828; and McLaughlin et al., (1989) J. Virol. 
62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged 
and can integrate. Space for exogenous DNA is limited to about 4.5 Kb An AAV vector 
such as that described in Tratschin et al.. (1985) Mol. Cell. Biol. 5:3251-3260 can be used 
to introduce DNA into cells. A variety of nucleic acids have been introduced into different 
cell types using AAV vectors (see for example Hermonat et al., (1984) PNAS USA 
81 -6466-6470: Tratsch.n et al., (1 985) Mol. Cell. Biol. 42072-2081 ; Wondisforo et al., 
(1988) Mol. Endocrinol. 2:32-39; Tratschin et al., (1984) J. Virol. 51 S1 1-619: and Flotte et 
al., (1993) J. Biol. Chem. 268:3781-3790). 

The AAV-based expression vector to be used typically includes the 145 nucleotide 
25 AAV inverted terminal repeats (ITRs) Hanking a restriction site that can be used for 
subcloning of the transgene. either directly using the restriction site available, or by 
excision of the transgene with restriction enzymes followed by blunting of the ends, ligation 
of appropnate DNA linkers, restriction digestion, and ligation into the site between the ITRs. 
The capacity of AAV vectors is about 4.4 kb. The following proteins have been 
expressed using various AAV-based vectors, and a variety of promoter/enhancers: 
neomycin phosphotransferase, chloramphenicol acetyl transferase. Fanconi s anemia gene, 
cystic fibrosis transmembrane conductance regulator, and granulocyte macrophage colony- 
stimulating factor (Kotin, R.M., Human Gene Therapy 5:793-801 , 1994, Table I). A 
transgene incorporating the vancus DNA constructs of this invention can similarly be 
included in an AAV-based vector. As an alternative to inclusion of a constitutive promoter 
such as CMV to drive expression of the recomoinant DNA encod.ng tne fusion protein(s). 
e.g. fusion proteins comprising an activation domain or DNA-bmomg domain, an AAV 
promoter can be used (ITR itself or AAV p5 (Flotte, et al. J. Biol.Chem. 268:3781-3790. 
1993)). 
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Such a vector can be packaged into AAV virions by reported methods. For 
example, a human cell line such as 293 can be co-transfected with the AAV-based 
expression vector anc another plasmid containing open reading frames encod.ng AAV rep 
and cap (which are obligatory for replication and packag.ng of the recombinant viral 
construct) under tne control of endogenous AAV promoters or a heterologous promoter. In 
the absence of helper virus, the rep proteins Rep68 and Rep78 prevent accumulate of 
the repl.cative form, but upon superinfection with adenov.rus or herpes v.rus, these 
proteins permit replication from the ITRs (present only in the construct containing the 
transgene) and expression of the viral caos.d proteins. This system results in packag.ng 
of the transgene DNA into AAV virions (Carter, B.J., Current Opinion in Biotechnology 
-V533--39 1992' Kotin R.M. Human Gene Therapy 5:793-801 , 1 994)). Typically, three 
days after 'transferor,, recombinant AAV is harvested from the cells along with adenovirus 
and the contammat.ng aoenovirus is tnen inactivated by heat treatment. 

Methods to imorove tne titer of AAV can also be used to express the transgene in 
an AAV virion Such strateo.es include, but are not limited to: stable expression of the 
ITR-flanked transgene in a eel! line followed by transfection with a second plasm.d to direct 
viral packaging; use of a cell line that expresses AAV proteins inducibly, such as 
temperature-sensitive .nducible express.on or pharmacolog.cally inducible expression. 
Alternatively a cell can be transformed with a first AAV vector includ.ng a 5' ITR, a 3' !TR 
flanking a heterolocous gene, and a second AAV veotor wh.ch includes an inducible or.gin 
of replication, e.g., SV40 origin cf replication, which is capable of being induced by an 
agent such as the SV40 T antigen and which includes DNA sequences encoding the AAV 
rep and cap proteins. Upon induction by an agent, the second AAV vector may replicate 
to a high copy number, and thereby increased numbers of infectious AAV partcles may be 
generated (see, e.g. U.S. Patent No. 5.693,531 by Chiorini et al., issued December 2. 
1 go? In yet another method for producing large amounts of recombinant AAV, a plasmid is 
used which incorporate the Epstein Barr Nuclear Antigen (EBNA) gene . the latent cng.n of 
replication of Epstein Barr virus (oriP) and an AAV genome. These plasmids are 
maintained as a multicopy extra^hromosomal elements in cells, such as in 293 cells. Upon 
addition of wild-type helper functions, these cells will produce high amounts of recombinant 
AAV (U S Patent 5.691.176 by Lebkowski et al., issued Nov. 25, 1997). In another 
system, an AAV packag.ng plasmid is prov.ded that allows expression of the rep gene, 
wherein the p5 promoter, which normally controls rep expression, is repiacea wrth a 
heterologous promoter (U.S. Patent 5.658,776, by Flotte et al., issued Aug. 19. 1997). 
Add,tionally, one may increase the efficiency of AAV transduction by treating the cells with 
an agent that facilitates the conversion of the s.ngle stranded form to the double stranded 
form as described in Wilson et al.. WO96/39530. 

AAV stocKS can be producec as described in Hermonat and MuzyczKa (1 984) 
PNAS 81 -6466, modified by using the p AAV/Ad descnbed by Samulski et al. (1989) J. 
Virol 63 3822. Concentration and purification of the virus can be ach.eved by reported 
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methods such as banding in cesium chlonde gradients, as was used lor the initial report cf 
AAV vector expression in vivo (Flotte, et al. J.Biol. Chem. 268:3781-3790, 1993) or 
chromatographic purification, as oescribed in O'Rioraan etal., WO97/08298. 

Methods for in vitro packaging AAV vectors are also available and have the 
advantage tnat tnere is no size limitation of the DNA packaged into the particles (see, U.S. 
Patent No. £.688.676, by Zhou et al.. issued Nov. 18. 1997). This procedure involves tne 
preparation o! eel! free packaging extracts. 

For addraona! detailed guicance on AAV technology which may be useful in the 
prac'jee of the subject invention, including methods and matenais for the incorporation of a 
transgene. the propagation and purification of the recombinant AAV vector ccntam.ng the 
transoene and its use in transfecting cells and mammals, see e.g. Carter et al. US Patent 
No. 4797,368 (10 Jan "989); Muzyczka et al, US Patent No. 5,139.941 (18 Aug 1992), 
Lebkowski et al. US Patent No, 5.173.414 (22 Dec 1992); Snvastava, US Patent No. 
5,252.479 (12 Oct 1993), LebKOwski et al, US Patent No. 5.354,678 (11 Oct 1994); ShenK 
et al. US Patent No. 5.436,146(25 July 1995); Chatterjee et al, US Patent No. 5.454,935 
(12 Dec 1995), Caner et al WO 93/24641 (published 9 Dec 1993), and Natsoulis, U.S. 
Patent No. 5,622,856 (April 22. 1997). Further information regarding AAVs and the 
adenovirus or herpes helper functions required can be found in the following articles.Bems 
and Bohensky (1987). "Adeno-Associated Viruses: An Update", Advanced in Virus 
Research, Academic 

Press, 33:243-306. The genome of AAV is described in Laughlin et al. (1983) "Cloning cf 
infectious adeno-associated virus genomes in bacterial plasmids", Gene, 23: 65-73. 
Expression of AAV is desenbed in Beaton et al. (1989) "Expression from the 
Adeno-associated virus P 5 and p19 promoters is negatively regulated in trans by the rep 
protein", J. Virol.. 63:44504454. Construction of rAAV is desenbed in a number of 
publications: Tratschm et al. (1984) "Adeno-associated virus vector for high frequency 
integration, expression and rescue of genes in mammalian cells", Moi. Cell. Biol., 
4:2072-2081; Hermonat and Muzyczka (1984) "Use of adeno-associated virus as a 
mammalian DNA donmg vector. Transduction of neomycin resistance into mammalian tissue 
culture cells'. Proc. Natl. Acad. Sc.. USA, 81 :6466-6470; McLaughlin et al. (1988) 
"Adeno-associated virus genera! transduction vectors: Analysis of Proviral Structures", J. 
Virol., 62:1963-1973; and Samulski et al. (1989) "Helper-free stocks of recombinant 
adeno-associated viruses: normal integration coeqooe viral gene exoress.on". J. Virol. , 
63:3822-3828. Cel: lines tnat can be transformed by rAAV are those described in 
Lebkowsk, et al. (1 988) "Adeno-associated virus: a vector system for efficient introduction 
and integration of DNA into a vanety cf mammalian cell types". Mo!. Cell. Biol., 
8 3988-3996. "Prccucer" or "packaging" cell lines used in manufacturing recombinant 
retroviruses are oescribed in Dougherty et al. (1989) J. V,rol., 63:3209-3212; and 
Markowitz etal. (1988) J. Virol.. 62:1120-1124. 
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Hybrid Adenovirus-AAV Vectors 

Hybnd Adenovirus-AAV vectors represented by an adenovirus caps.o contam.ng a 
nucleic acid compnsmg a portion of an aaenovirus. and 5' and 3' ITR sequences from an 
AAV wn.cn flank a selected transgene unoer tne control of a promoter. See e.g. Wilson et 

. a! international Patent Application Publication No. WO 96/13598. This hybno vector is 
characterized by higr. titer transgene delivery to a host cell and the ability to stably 
integrate tne transgene into the host cell chromosome in the presence of the rep gene. This 
v ,rus is capable of infecting virtually all cell types (conferred by its adenovirus sequences) 
and stable long term transgene integration into the host cell genome (conferred by its AAV - 

-,c sequences). 

The adenovirus nucie.c acid sequences employed in the this vector can range from 
a m,n.mum sequence amount, which requires the use of a helper v.rus to produce the 
hybrid virus particle, to only selected deletions of adenovirus genes, which deleted gene 
products can be supplied in the hybrid viral process by a packaging cell. For example, a 
15 hybrid virus can comprise the 5' and 3' inverted terminal repeat (ITR) sequences of an 
aaenovirus (wh.ch function as ongms of replication). The left terminal sequence (5 1 ) 
sequence of the Ad5 genome that can be used spans bp 1 to about 360 of the 
conventional adenovirus genome (also referred to as map units 0-1) and includes the 5' 
ITR and the packaging/enhancer domain. The 3' adenovirus sequences of the hybrid v.rus 
20 include the right terminal 3' ITR seauence which is about 580 nucleotides (about bp 
35,353- end of the adenovirus, referred to as about map units 98.4-100. 

The AAV sequences useful in the hybrid vector are viral sequences from which the 
rep and cap polypeptide encoding sequences are deleted and are usually the c.s acting 5' 
and 3' ITR sequences. Thus, tne AAV ITR sequences are flanked by the selected 
25 adenovirus sequences and the AAV ITR sequences themselves flank a selected 

transgene. The preparation of the hybrid vector is further described in detail in published 
PCT application entitled "Hybrid Aoenovirus-AAV V,rus and Method of Use Thereof, WO 
96/13598 by Wilson et al. 

For additional detailed guidance on adenovirus and hybrid adenovirus-AAV 
30 technology which may be useful in the practice of the subject invention, including methods 
and matenals for the incorporation of a transgene. the pro D agation and punfication of 
recombinant virus containing the transgene, and its use in transfecting celis ana mammals, 
see also Wilson et a;. WO 94/28936. WO 96/13597 and WO 95/26285. ana references 
cited therein. 

35 

Retroviruses 

The retroviruses are a group of single-stranded RNA viruses characterized by an 
ability to convert their RNA to double-stranded DNA in infected cells by a orocess o< 
reverse-transcription (Coffin (1990) Retrovindae and their Replication" In Fields, Kn.pe ed. 
40 Virology. New York: Raven Press). The resulting DNA then stably integrates into cellular 
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chromosomes as a proves and directs synthesis of viral prote.ns. The integrator. resuUs 
in the retent,on of tne v.ral gene seouences in the recipient cell and its descendants. The 
retroviral genome conta.ns three genes, gag, pol. and env that code for caps.da. prote.ns, 
polymerase enzyme, and envelope components, respect.ve.y. A sequence found 
upstream from the gag gene, termed psi . functions as a signal for packaging of genome 
into v.r.ons. Two lone term.na. repeat (LTR) sequences are present at the 5 and 3 ends 
cf the viral genome These contain strong promoter and enhancer sequences and are also 
reauired for integration in the host cell genome (Coffin (1990). supra). 

In oroer to construct a retroviral vector, a nucle,c acid of .merest is .nserted .nto the 
, v,ral genome ,n tne place of certa.n viral sequences to produce a v.rus that ,s 

repl.cat.on-defect.ve. In order to produce v.r.ons, a packaging cell line oonta.n.ng the gag, 
po' and env genes but without the LTR and psi components is constructed (Mann et al. 
, 1 983) Cell 33 153) When a recomo-.nant plasmid containing a human cDNA, together with 
the retroviral LTR and ps. sequences is .ntroduced into this eel, line (by calcium phosphate 
5 precip;tation for example), the ps, sequence allows the RNA transenpt of the recombinant 
" piasm.d to be packaged into wal particles, which are then secreted into the culture med.a 
(N,colas and Rubenste.n (1988) "Retrov.ral Vectors". In: Rodnguez and Denhardt ed. 
Vectors- A Survey of Molecular Clon,ng Vectors and their Uses. Stoneham:Butterworth; 
Temin (1 986) "Retrovirus Vectors for Gene Transfer: Efficient Integration .nto and 
2 n Expression of Exogenous DNA in Vertebrate Cell Genome". In: Kuchertapat, ed. Gene 
Transfer New York: Plenum Press: Mann et al.. 1 983. supra). The media conta.n.ng the 
recombinant retroviruses is then collected, optionally concentrated, and used for gene 
transfer. Retroviral vectors are able to infect a broad variety of cell types. However, 
integration and stable express.on requ.re the division of host cells (Pask.nd et al. (1 975) 

25 V,r0l ° 9y A 6 ^f prerequis|te for tne use of re trov,ruses is to ensure the safety of their use, 
part.cu.ariy with recard to tne possibility of the spread of wild-type v.rus in the cell 
populat.cn. The development of spec.al.zed cell lines (termed "packaging cells , wh.ch 
produce only replication-defective retroviruses has increased the utility of retroviruses or 
30 gene therapy, and defective retroviruses are well charactenzed for us* m gene transfer for 
aene therapy purposes (for a rev.ew see Miller. A.D. (1990) Blood 76.271). Thus, 
recombinant retrov.rus can be constructed in wh,ch part of the retroviral coding seouence 
(gag pol env) has been replaced by nuc.e,c acid encoding a fusion protein of the present 
invention, rendenng the retrov.rus replication defective. The repi.cation defective retrovirus 
3£ is then packaged ,nto v.nons wh.ch can be used to infect a target eel, through tne use of a 
helper virus by standard techniques. Protocols for produc.ng recomb.nant retrov.ruses ano 
for infecting cells in vitn, or in v.vo with such viruses can be found in Current Protons .n 
Molecular B.o.ogy. Ausubel. P.M. et a... (eds.) Greene Publ.sh.ng Assoc.ates. ( 989), 
Sections 9 10-914 and other standard laboratory manuals. Examples of suitable 
40 feC'ses Llude pU. pZ.P. pWE and pEM wh.ch are we,, known to those skil.ed ,n the 
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art A preferred retroviral vector is a pSR MSVtkNeo (Muller et al. (1991 ) Mol. Cell B.ol. 
1 1-1785 and pSR MSV(Xbal) (Sawyers et al. (1995) J. Exp. Med. 181 :307) and 
derivatives thereof. For example, the un.aue BamHI sites in both of these vectors can oe 
removed by digesting the vectors w.th BamHI, filling .n with Klenow and rel.gaUng to 
produce P SMTN2 and pSMTX2, respectively, as described in PCT/US96/09948 by 
Clackscn et al. Examoies of suitable packaging v.rus lines for prepanng both ecotropic 
and ampnotropic retroviral systems include Crip. Cre, 2 and Am. 

Retroviruses have been used to introduce a variety of genes into many d.tterent 
ce'l types including neural cells, epitnelial cells, endothelial cells, lymphocytes, myoclasts, 
nepatocytes, bone marrow ceils, m vitro and/or in v.vo (see for example Egl.tis et a!.. 
(1985) Science 230:1395-1398; Danos and Mulligan, (1988) PNAS USA 85:6460-6464; 
Wilson et al (1988) PNAS USA 85:3014-3018; Armentano et al.. (1990) PNAS USA 
876141-6145; Huber etal., (1991) PNAS USA 88:8039-8043; Ferry eta!., (1991) PNAS 
USA 88:8377-8381; Chowdhury et al.. (1991) Science 254:1802-1805; van Beusechem et 
a i (10Q 2 ) PNAS USA 69:7640-7644; Kay et al.. (1992) Human Gene Therapy 3:641-647. 
Dai etal (1992) PNAS USA 89:10892-10895; Hwu et al., (1993) J. Immunol. 150:4104- 
4115 U S Patent No. 4.868.1 16; U.S. Patent No. 4.980.286; PCT Application WO 
89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and PCT 

Application WO 92/07573). 

Furthermore, it has been shown that it .s poss.ole to limit the infection spectrum of 
retroviruses and consequently of retroviral-based vectors, by modifying the viral 
packaging proteins on the surface of the viral particle (see, for example PCT pubi.cat.cns 
W093/25234 WO94'06920, and W094/1 1524). For instance, strategies for the 
modification of the infection spectrum of retroviral vectors include: coupling antibooies 
specific for cell surface antigens to the viral env protein (Roux et al., (1989) PNAS USA 
86-9079-9083; Julan et al.. (1992) J. Gen Virol 73:3251-3255; and Goud et al.. (1983) 
Virology 1 63'251 -254); or coupling cell surface ligands to the viral env proteins (Neaa et 
al (1°91)J B.ol. Chem. 266:14143-14146). Coupling can be in the form of the chemical 
cross-linking with a protein or other variety (e.g. lactose to convert the env protein to an 
3C asialoglycoprotein). as well as by generating fusion proteins (e.g. smgle-cha.n 

ant.body/env fusion proteins). This techn.que, while useful to limit or otnerw.se direct tne 
infection to certain tissue tyoes. and can also be used to convert an ecotropic vector in to 
an ampnotropic vector. 



35 Other Viral Systems 

Other viral vector systems that may have application ,n gene therapy have oeen 
derived from herpes wus. e.g., Herpes S.mplex Virus (U.S. Patent No. 5.631 .236 by Woe 
et al issued May 20 1997). vaccinia virus (Ridgeway (1988) R.cgeway. •Mammalian 
expression vectors.' In: Rodnguez R L. Denhardt D T. ed. Vectors: A survey of molecular 
40 cloning vectors and their uses. Stoneham: Butterworth ,; Ba.ohwal and Sugden (1 986) 
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•Vectors for gene transfer derived from animal DNA viruses: Transient and stable 
expression of transferred genes," In: Kuchertapati R, ed. Gene transfer. New York: Plenum 
Press: Coupar et al. (1988) Gene, 68:1-10). and several RNA viruses. Preferred viruses 
mclude an alphavirus, a poxvirus, an arena virus, a vaccinia virus, a polio virus, and the 
like. In particular, herpes virus vectors may provide a unique strategy for persistence of 
the recombinant gene in cells of the central nervous system and ocular tissue (Pepose et 
al., (1994) Invest Ophthalmol Vis Sci 25:2662-2666). They offer several attractive 
features for various mammalian cells (Fnedmann (1989) Science, 244:1275-1281 : 
Ridgeway, 1988, supra: Baichwal and Sugden, 1986, supra; Coupar et al., 1988; Horwich 
et al.(1990) J.Virol., 64:642-650). 

With the recent recognition of defective hepatitis B viruses, new insight was gamed 
into the structure-1unct.cn relationship of different viral sequences. In vitro studies showed 
that the virus could reta.n tne aDility for helpeMJependent packaging and reverse 
transcription despite the deletion of up to 80% of its genome (Horw.ch et al., 1990, supra). 
Th.s suggested that large portions of tne genome could be replaced with foreign genetic ' 
material. The hepatotropism anc persistence (integration) were particularly attractive 
properties for liverO,rected gene transfer. Chang et al. recently introduced the 
chloramphenicol acetyltransf erase (CAT) gene into duck hepatitis B virus genome in the 
place of the polymerase, surface, and p^e-surface coding sequences. It was cotransfected 
with wild-type virus into an avian hepatoma cell line. Culture media containing high titers of 
the recomoinant virus were used to infect primary duckling hepatocytes. Stable CAT gene 
expression was detected for at least 24 days after transfection (Chang et al. (1991) 
Hepatology, 14:124A). 

Administration of Viral Vectors 

Generally the DNA or viral particles are transferred to a biologically compatible 
solution or pharmaceutically acceptable delivery vehicle, such as sterile saline, or other 
aqueous or non-aqueous isotonic sterile injection solutions or suspensions, numerous 
examples of which are well known in the art. including Ringer's, phosphate buffered saline, 
or other similar vehicles. Delivery of the transgene as naked DNA; as lipid-, liposome-, or 
otherwise formulated DNA; or as a recombinant viral vector is then preferably earned out 
via in vivo, lung-directed, gene therapy. Th.s can be accomplished by various means, 
including nebulizatioa'inhalation or by instillation via bronchoscopy. Recently, recombinan 
aoenovirus encoding CFTR was administered via aerosol to numan subjects in a phase I 
clinical trial. Vector DNA and CFTR expression were dearly detected in the nose and 
airway of these patients with no acute tox.c effects (Bellonet al., Human Gene Therapy. 
8(1):15-25, 1997). 

Preferably, the DNA or recombinant virus is administered insufficient amounts to 
transfect cells within the recipient s a.rways. including without limitation vanous airway 
epithelial cells, leukocytes residing within the a.rways and accessible airway smooth 
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muscle cells and provide sufficient levels of transgene expression to provide for 
observable llgand-resoonsive transcription of a target gene, preferably at a level providing 
therapeutic benefit without undue adverse effects. 

Optimal dosages of DNA or virus depends on a variety of factors, as discussed 
P rev 10 usly and may thus vary somewhat from patient to patient. Again, therapeutically 
effective doses of viruses are considered to be in the range of about 20 to about 50 ml of 
saline solution containing concentrations of from about 1 X 10? to about 1 X 10™ pfu of 
virus/ml, e.g. from 1 X 108 to 1 X 109 pfu of virus/ml. 

In a preferred embedment, the ratio of viral particle containing a target gene versus 
virai particles containing nucieic acids encoding the fusion proteins of the invention ,s about 
1 ■ 1 However other rat.os can aiso be used. For example, in certain instances it may be 
desirable to administer twice as many particles having the target gene as those encoding 
the fusion proteins. Other ratios include 1:3, 1:4. 1:10, 2:1 , 3:1 . 4:1 . 5:1 , 10:1. The optimal 
ratio can be determined by performing in vitro assays using the different ratios of viral 
particles to determine which ratio results in highest expression and lowest background 
expression of the target gene. Similarly, in situations in which the fusion proteins are 
encoded by two different nucleic acids each encapsidated separately, one can vary the 
ratio between the tr.ree viral particles, according to the result desired. 

2C Methods of the invention 

The invention provides methods for engmeenng cells to renoer them responsive to 
ligand-mediated regulation of exoression of a target gene. The cells may be engineered in 
vitro (ex vivo) or in vivo (i.e., in situ— within an organism). The target gene can be an 
endogenous gene or an exogenous gene (which may be of naturally occurnng pect.de 
25 sequence, or may contain non-naturally occurnng peptide sequence). The method 
composes introducing into tne cell(s) of interest one or more genetic constructs or 
compositions of this invention. Examples of these methods include the genetic engmeenng 
of cells or animals (e.g., mice, rats, etc.) as described herein for use, e.g., in the study of 
normal or pathologic biological processes (including vanous diseases), for the identification 
3G or characterization of genes or for the identification of new drugs or the evaluation of drug 

functioning, mechanism or efficacy. Other examples include the delivery of gene tnerapy to 
human subjects, whether in vivo or ex vivo. 

The invention a lS o provides methods for using such engineered cells, or organisms 
containing tnem. to carry out the oo,ect,ves mentioned above and elsewhere herein as well 
as in the cited references. These methods generally involve the application of l.gand to the 
engineered cells or organ.sm containing them in oroer to regulate the expression of a target 
gene. 
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KitS 

This invention further provides kits useful for the vanous applications. One such 
kit contains one or more nucieic acids, each encoding a fusion protein of the invention. The 
kit may further comprise an addttional nucleic acid comprising a target gene construct. 
Alternatively, the additional nucleic acid may contain a cloning site for insertion of a aesired 
target gene by the practitioner. The krt may further contain a sample of a ligand for 
regulating gene expression using these matenals. 

Uses 

In one application, cells engineered in accordance with the invention are used to 
produce a target protein in vitro. In sucn applications, the cells are cultured or otherw.se 
ma.nta.ned until production of the target protein is desired. At that time, the appropriate 
l.gand is added to the culture medium, in an amount sufficient to cause the desired level of 
target protein production. The protein so produced may be recovered from the medium or 
from the cells, and may be punfied from other components of the cells or medium as 
desired. 

Proteins for commercial and investigational purposes are often produced us.ng mammalian 
cell lines engineered to express the protein. The use of mammalian cells, rather than 
bacteria .nsect or yeast cells, is indicated where the proper function of the protein requires 
post-translational modifications not generally performed by non-mammal.an cells. Examples 
of proteins produced commercially this way include, among others, erythropoietin, BMP-2, 
t.ssue plasminogen activator. Factor Vlllic. Factor IX, and antibodies. The cost of 
producing proteins in this fashion is related to the level of expression achieved in the 
engineered cells. Thus, because the invention described herein can achieve considerably 
higher expression levels than conventional expression systems, it may reauce the cost of 
protein production. Toxicity of target prote.n production can represent a second limitation, 
preventing cells from growing to high density and/or reducing production levels. Therefore, 
the ability to tightly control protein expression, as described herein, permits cells to be 
grown to high density in the absence of protein production. Expression of the target gene 
can be activated and the prote.n product subsequently harvested, only after an optimum 
cell aensity is reached, or when otherwise desired. 

In other applications, cells within an animal host or human subject are engineered in 
acccrdance with the invention, cr cells so engneered are introduced into the anima' or 
human subject, in either case, to prepare the recipient for tigand-mediated regulation cf 
expression of a target gene. In the case of non-human an.mals, this can be done as part of 
veterinary treatment ot the animal or to create an animal model for a vanety cf research 
purposes. In the case of human sub,ects. this can be done as part cf a tneraaeutic or 
prophylactic treatment program. 
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This invention is applicable to a vanety of treatment approaches. For example, the 
target gene to be regulated can be an endogenous gene or a heterologous gene, ana its 
expression may be activated or repressed by addition of ligand. 

In some cases the target gene is a factor necessary for the proliferation and'or 
differentiation of one or more cell types of .merest. For example, it may be desirable to 
stimulate the expression of growth factors and lymphokines in a subject in wh.ch at least 
some of the blood cells have been destroyed, e.g., by radiotherapy or chemotherapy. For 
example, expression of erythropoietin stimulates the production of red blood cells, 
express.on of G-CSF stimulates the production of granulocytes, expressionof GM-CSF 
stimulates the produt.on of various white blood cells, etc. Similarly in diseases or 
conditions in which one or more specific cell types are destroyed by the disease process, 
e g . in autoimmune diseases, the soecif.c cells can be replenished by stimulating 
express.on of one or more genes encoding factors stimulating proliferat.on of these cells. 
The method of the invention can also be used to increase the number of lymphocytes in a 
subject having AIDS, such as by stimulating expression of lymphokines, e.g., IL-4. which 
stimulates proliferation of certain T helper (Th) cells. 

At least one advantage of increasing the production of an endogenous protein in a 
subject is the absence of an immune reaction against the protein, thus resulting in a more 
efficient treatment of the subject. In some cases of regulated express.on of a heterologous 
protein it may be preferable to simultaneously administer to the subject an 
immunosuppressant orug, e.g.. raoamycn, cyclosporin A, FK506 or a mixture of any of the 
foregoing or other compound which represses immune reactions. 

Cells which have been modified ex vivo with the DNA constructs of the invention 
may be grown in culture under selective conditions and cells which are selected as having 
the desired construct(s) may then be expanded and further analyzed, using, for example, 
the polymerase chain reaction for determm.ng the presence of the construct in the host cells 
and/or assays for the production of the des.red gene product(s). Once modified host cells 
have been identif.ed, they may then be used as planned, e.g. grown in culture or 
introduced into a host organism. 

In cases in which the target gene is an endogenous gene of the cells to be 
engineered the promoter and/or one or more other regions of the gene can be modified to 
include a target sequence that is specifically recognized by the DNA binding domain of a 
fusion protein of this invention so that the endogenous target gene is specifically 
recognized and regulated in a hgand-dependent manner. Such an embod.ment can be 
useful in situations in wh.ch no DNA b.nd.ng protein is known to specifically bind to a 
regulatory reg.on of the target gene. Thus, in one embodiment, one or more cells are 
obtained from a subject or other source and genetically engineered in vitro sucn tnat a 
des.red control element is inserted, operat.vely linked to the target gene. Tne cell can then 
be introduced into the subject. Alternatively, prior to introduction of the cell to the subject, 
the cell is further modified to .ndude a nuclek: aad encoding a fusion protein compnsmg a 
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DNA binding domain which is capable of interacting specifically with the expression control 
element introduced into the target gene. In other examples of the invention, an 
endogenous gene is modified in vivo by, e.g., homologous recombination, a technique well 
known in the an, and described, e.g., in Thomas and Capecchi (1 987) Cell 51 :503; 
Mansour et a!. (1988) Nature 336:348; and Joyner et a!. (1989) Nature 338:153. 

A target gene may encode antisense RNA or a ribozyme or other RNA molecule 
which is not translated. For example, the method of the invention can be used to inhibit 
production of one or more specific proteins in a cell of a subject. The availability of potent 
transcriptional activators provided by the invention will ensure that high levels of RNA. 
e.g., antisense RNA, are produced in a cell. 

Other uses for this invention induoe biological research. The two-hybrid assay is 
a transcription based assay first described by Fields and Song, Nature, 340:245-247 
(1989). See also, F.eids et ai, US Patent No. 5,283,173 (1 Feb 1994). The two-hybrid 
assay is based on the cbservation that transcnption factors contain separable functional 
modules that direct either DNA binding or transcription activation. A DNA binding domain 
expressed in cells will bind to DNA but not activate transcnption as it lacks a transcnption 
activation domain. Conversely, a transcription activation domain alone will not effect 
transcnption in the absence of directed and/or intimate interaction with DNA such as would 
be provided by a DNA-binding domain. However, if the DNA binding domain and the 
transcnption activation domains are eacn expressed as part of separate fusion proteins, 
and the fusion protems are capable of associating, the "two-hybrid" complex so fomnec 
represents a reconstituted transcnption factor (see FIG. 1). Such a reconstituted 
transcnption factor is capable of initiating transcription of a reporter gene (e.g., a gene for a 
conveniently detectaole marker such as beta-galactosidase or alkaline phosphatase 
(SEAP) or a protein important for cell viability) located downstream of DNA binding sites 
recognized by the DNA-binding domain. The amount of reporter gene expression, i.e.. the 
amount of gene product produced, will reflect the extent to which the fusion proteins 
complex with one another. As described in Example 8, use of the bundling domains of this 
invention to recruit additional activation domains to the complex significantly increases the 
sensitivity of the assay, such that interactions which were previously undetected are now 
clearly visible. 

This dramatic improvement has important ramifications for a vanety of applications 
of the 2-hybrid methocolcgy. including those aimed at identifying genes of interest, a: 
identifying peptide binding partners, and at identifying inhibitors of a protein-protein 
interaction of interest. 

For instance, to identify genes of interest, e.g. cDNAs from a cDNA library, the genes 
are cioned into a construct designee to express the encoded polypeptides as fusion 
proteins linked to a bundling domain and to a transcnption activation domain. As an 
example of the design of such constructs, one may start with a construct encoding a fusion 
protein such as an RLS fusion protein depicted in Fig 3, but replace the DNA sequence 
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encoding a ligand binding domain with a doning site for the insertion of the cDNAs. The 
constructs (beanng the cDNA inserts) are introduced into host cells containing (or 
subsequently made to contain) (i) a nucleic acid encoding a fusion protein containing a 
DNA binding ooma.n and a target domain of interest, and (ii) a reooner gene construct 
containing a recognition sequence for the DNA binding domain operabiy linked to a gene 
which encodes a detectable gene product or which is otherwise responsible for a 
detectable phenotype. Cells expressing a fusion protein containing a cDNA-encoded 
domain which binds to the target domain of interest express the reporter gene construct. 
The corresponding cDNA can thus be identified based on the fact that the protein it 
encodes binds to the target domain of interest. Potential advantages mcluae tne enhanced 
ability to aetect and identify less abundant cDNAs, cDNAs which are expressed at lower 
levels relative to other cDNAs, cDNAs encoding gene products which bind to the target 

with relatively lower affinity, etc. 

In another 2-hybrid application, a collection of polypeptides may be expressed as 
fusion proteins using nucie.c ac;d constructs encoding the desired collection of 
polypeptides in place of the cDNAs in the previous example. Peptide sequences which 
bind to a target protein or domain of interest may thus be identified. 

Another such application involves assays for identifying inhibitors of proteimprotein 
interactions of interest. In such assays a host cell is engineered to express two fusion 
proteins, the first containing a DNA binding domain and a first protein domain of interest, the 
second fusion protein contam.ng a transcnption activation domain, a bundling domain and a 
second protein domain of interest which binds to the first protein domain of interest. The 
ceils also contain a reporter gene construct as described above. Because the two fusion 
proteins bind to one another, the reporter gene is normally expressed. Such cells may De 
used to identify compounds which inhibit the protein :protein interaction, for instance in a 
drug screening program. Thus, cells containing fusion proteins of this invention may be 
contacted with one cr more compounds to be tested. The presence or amount of reporter 
gene product is then measured. A decrease in reporter expression in the presence of a 
substance, as compared to expression in the presence of less or none of the substance, 
indicates that the substance inhibited the protein:protein interaction. For additional details 
on the design and implementation cf such assays which can be adapted to this invention, 
see e.g. WO 95/24419. Substances for testing may be obtained from a wide variety of 
sources, including without limitation, microbial broths, cellular extracts, condrtionea media 
from cells, combinatorial libraries and other sources of naturaliy-occurnng or synthetic 
compounds. 
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Administration 

The ligand may be administered fo a human or non-human subject using 
pharmaceutical^ acceptable materials and methods of administration. Various 
formulations, routes of aaministration, dose and dosing scnedule may be used tor the 
aaministration of ligand. depending upon factors such as the binoing affinity of the hgand for 
the hgand binding domain, the choice of transcription regulatory domains, the condition and 
cirmcumstances of the recipient, the response desired, the biological half-life and 
bioavailability of the ligand, the biological half-life and specific activity of the target gene 
product, the number and location of engineered cells present, etc. The drug may be 
administered parenterally, or more preferably orally. Dosage and frequency of 
administration will depend upon factors such as described above. The drug may be taken 
orally as a pill, powaer, or dispersion; bucally; sublingually; injected intravascular^, 
intraperitoneal^, subcutaneously; or the like. The drug (and antagonists, as discussed 
below) may be formulated using conventional methods and materials well known in the art 
for the various routes of administration. The precise dose and particular method of 
administration will depend upon the above factors and be determined by the attending 
physician or healthcare provider. However, we show here that in the presence of bundled 
activation domains, the amount of drug needed to oligomenze the fusion proteins of this 
system is greatly reduced, by an order of magnitude or more. 

The particular dcsage of the drug for any application may be determined in 
accordance with conventional approaches and procedures for therapeutic dosage 
monitoring. A dose of the arug within a predetermined range is given and the patient's 
response is monitored so that the level of therapeutic response and the relationship of 
target gene expression level over time may be determined. Depending on the expression 
levels observed dunng the tjme penod and the therapeutic response, one may adjust the 
level of subsequent dosing to alter the resultant expression level over time or to otherwise 
improve the therapeutic response. This process may be iteratively repeated until the 
dosage is optimized for therapeutic response. Where the dnjg is to be administered 
chronically, once a majntenance dosage of the drug has been determined, one may conduct 
penodic follow-up monitoring to assure that the overall therapeutic response continues to 
be achieved. 

In the event that tne activation by the drug is to be reversed, administration of arug 
may be suspended so that cells return to a basal rate of proliferation. To effect a more 
active reversal of therapy, an antagonist of the drug may be acministered. An antagonist is 
a compound which binds to the drug or drug-b.nding domain to inhibit interaction of the drug 
with the fusion protein(s) and thus inhibit the downstream biological event. Antagonists 
include dnjg analogs, homologs or components which are monovalent with resoect to the 
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fusion proteins. Such compounos bind to the fusion prote.ns but oo not support clustering 
of the fusion proteins as .s required for activation of signaling. Thus, in the case of an 
aaverse reaction or the desire to terminate tne therapeutic effect, an antagonist can be 
administered in any convenient way, particularly intravascular^ or by 
inhalatiorv'nebulization, if a rapid reversal is desired. 

Compositions 

Drugs (i.e., the ligancs) for use in this invention can exist in free form or, where 
appropnate in salt form. Tne preparation of a wide vanety of pharmaceutical ly acceptable 
salts is well-known tc these of skill ,n the art. Pharmaceutical^ acceptable salts of venous 
compounos include tne conventional non-tox.c salts or the quaternary ammonium safts of 
such compounos when are formed, for example, from inorganic or organic acids of bases. 

The drugs mav form hydrates or solvates. It is known to those of skill in the art that 
charged compounds form hydrated species when lyophilized with water, or form solvated 
species when concentrated in a solution with an appropnate organic solvent. 

The drugs can aiso be administered as pharmaceutical compositions compnsmg a 
therapeutically (or proohylacticaliy) effective amount of the drug, and a pharmaceutical^ 
acceptable earner or excipient. Camers include e.g. saline, buffered saline, dextrose, water, 
glycerol ethanol, anc combinations thereof, and are discussed in greater detail below. The 
composition, if desired, can also contain minor amounts of wetting or emulsifying agents, or 
pH buffering agents. The composition can be a liquid solution, suspension, emulsion, 
tablet, pill, capsule, sustained release formulation, or powder. The composition can be 
formulated as a suppository, with traditional binders and earners such as triglycerides. Oral 
formulation can include standard earners such as pharmaceutical grades of manmtol. 
lactose, starch, magnesium stearate. sodium sacchanne, cellulose, magnes.um carbonate, 
etc. Formulation may involve mixing, granulating and compressing or dissolving the 
ingredients as appropriate to the desired preparation. 

The pharmaceutical earner employed may be, for example, either a solid or liquid. 
Illustrative solid carriers include lactose, tena alba, sucrose, talc, gelatin, agar, pectin, 
acacia magnesium stearate, stearic acid and the like. A solid earner can include one or more 
substances which may also act as flavonng agents, lubricants, solubilizers, suspending 
agents fillers, glidants. compression aids, binders or tablet-disintegrating agents; rt can 
also be an encapsulate material. In powders, the earner is a finely divided solid which is 
in admixture with the finely oivided active ingredient In tablets, tne aa.ve ingredient is 
mixed w.th a earner having the necessary compression properties in suitable proportions 
and compacted in the shape and size desired. The powders and tablets preferably contain 
up to 99% of the active .ncredient. Suitable solid camers .nclude. for examole, calcium 
phosphate, magnes.um stearate. talc, sugars, lactose, dextrin, starch, gelatin, cellulose, 
methyl cellulose, sodium carboxymethyl cellulose, polyvmylpyrrolid.ne. low melting waxes 
40 and ion exchange resins. 
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Illustrative liquid earners include syrup, peanut oil, olive oil, water, etc. Liquid 
carriers are used in preparing solutions, suspensions, emulsions, syrups, elixirs and 
pressurized compositions. The active ingredient can be dissolved or suspended in a 
pharmaceutical^ acceptable liquid carrier such as water, an organic solvent, a mixture of 

5 both or pharmaceutical^ acceptable oils or fats. The liquid earner can contain other suitable 
pharmaceutical additives such as soiubilizers, emulsifiers, buffers, preservatives, 
sweeteners, fiavonng agents, suspending agents, thickening agents, colors, viscosity 
regulators, stabilizers or osmo-regulators. Suitable examples of liquid earners for oral and 
parenteral administration include water (partially containing additives as above, e.g. 

io cellulose derivatives, preterably sodium carboxymethyi cellulose solution), alcohols 

(including monohyanc alcohols ana polyhydnc alcohols, e.g. glycols) and their derivatives, 
and oils (e.g. fractionated coconut oil and arachis oil). For parenteral administration, the 
carrier can also be an oily ester such as ethyl oleate and isopropyl mynstate. Sterile liquid 
carders are useful in stenle liquid form compositions for parenteral administration. The liquid 

15 carrier for pressunzed compositions can be halogenated hydrocarbon or other 

pharmaceutical^ acceptable propellant. Liquid pharmaceutical compositions which are 
sterile solutions or susoensions can be utilized by, for example, intramuscular, 
intraperitoneal or subcutaneous injection. Sterile solutions can also be administered 
intravenously. The drugs can also be administered orally either in liquid or solid composition 

20 form. 

The earner or excipient may include time delay matenal well known to the art, such 
as glyceryl monostearate or glyceryl distearate along or with a wax, ethyicellulose, 
hydroxypropylmethyicellulose, methylmethacrylate and the like. When formulated for oral 
administration, 0.01°- Tween 80 in PHOSAL PG-50 (phospholipid concentrate with 
25 1 ,2-propylene glycol, A. Nattermann & Cie. GmbH) may be used as an oral formulation for 
a variety of drugs for use in the practice of this invention. 

A wide variety of pharmaceutical forms can be employed. If a solid earner is used, 
the preparation can be tableted, placed in a hard gelatin capsule in powder or pellet form or 
in the form of a troche or lozenge. The amount of solid carrier will vary wiaely but 
30 preferably will be from about 25 mg to about 1 g. If a liquid earner is used, the preparation 
will be in the form of a syrup, emulsion, soft gelatin capsule, sterile injectable solution or 
suspension in an ampuie or vial or nonaqueous liquid suspension. 

To obtain a stable water soluble dosage form, a pharmaceutical^' acceptable salt o< 
the drug may be dissolved in an aqueous solution of an organic or inorganic acic, such as a 
35 0.3M solution of succinic acid or citnc acid. Alternatively, acidic derivatives can be dissolved 
in suitable basic solutions, if a soiuole salt form is not available, the compcunc is 
dissolved in a suitable cosolvent or combinations thereof Examples of such suitable 
dissolved in a suitable cosolvent or combinations thereof. Examples of such suitable 
cosolvents include, but are not limited to, alcohol, propylene glycol, polyethylene glycol 
40 300, polysorbate 80, glycerin, polyoxyethylated fatty acids, fatty alcohols or glycerin 
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hydroxy fatty acids esters and the like in concentrations ranging trom 0-60% of the total 
volume. 

Various delivery systems are known and can be used tc administer the drugs, or 
the various formulations thereof, including tablets, capsules, injectable solutions, 
encapsulation in liposomes, microparticles, microcapsules, etc. Preferred routes of 
administration to a patient are oral, sublingual and bucal. Methods of introduction aiso could 
include but are not limited to dermal, intradermal, intramuscular, intrapentoneal, intravenous, 
subcutaneous, intranasal, pulmonary, epidural, ocular and (as is usually preferred) oral 
routes. The drug may be administered by any convenient or otherwise appropnate route, 
c for example by infusion or bolus injection, by absorption through epitheiial or 

mucocutaneous linings (e.g., orat mucosa, rectal and intestinal mucosa, etc.) and may be 
administered together with other biologically active agents. Administration can be systemic 
or local. For ex vivo applications, the orug will be delivered as a liquid solution to the 
cellular composition. 

5 In a specific embodiment, the composition is formulated in accordance with routine 

procedures as a pharmaceutical composition adapted for intravenous administration to 
human beings. Typically, compositions for intravenous administration are solutions in 
sterile isotonic aqueous buffer. Where necessary, the composition may also include a 
solubiiizing agent and a local anesthetic to ease pain at the side cf the injection. Generally, 
>o the ingredients are supplied either separately or mixed together in unit dosage form, tor 
example, as a lyophilizec powder or water free concentrate in a hermetically sealed 
container such as an ampoule or sachette indicating the quantity of active agent. Where the 
composition is to be administered by infusion, it can be dispensed with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition is 
25 administered by injection, an ampoule of sterile water for injection or saline can be provided 
so that the ingredients may be mixed prior to administration. 

in addition, in certain instances, it is expected that the compound may be disposed 
within devices placed upon, in, or under the skin. Such devices include patches, implants, 
and injections which release the compound into the skin, by either passive or active 
30 release mechanisms. 

Materials and methods for producing the vanous formulations are well known in the 
art and may be adapted for practicing the subject invention. See e.g. US Patent Nos. 
5,182,293 and 4,837.31 1 (tablets, capsules and other oral formulations as well as 
intravenous formulations) and European Patent Application Publication Nos. 0 6-9 659 
35 (published April 26, 1995; rapamycin formulation for IV administration) and 0 648 494 
(published April 19, 1995; rapamycin formulation for oral administration). 

The effective dose of the drug will typically be in the range of about 0.01 to about 
50 mg/kgs, preferably about 0.1 to about 10 mg/kg of mammalian body weight, 
administered in single or multiple doses. Generally, the compound may be administered to 
40 patients in need of such treatment in a daily dose range of about 1 to about 2000 mg per 
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patient, in embodiments in which the compound is raoamyan or an analog thereof with 
some residual immunosuppressrve effects, it is preferred that the dose administered be 
below that associated with undue immunosuppressive eflects. 

The amount of a given drug which will be effective in the treatment or prevention of 
5 a particular disorder or condition will depend in pari on the seventy of tne disorder or 

condition, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. 
Effective doses may be extrapolated from dose-response curves aerived from in vitro or 
animal model test systems. The precise dosage level should be determined by the 
ic attending physician or other health care provider and will depend upon well known factors, 
including route of administration, and the age, body weight, sex and general health of the 
individual; the nature, seventy and clinical stage of the disease; the use (or not) of 
concomitant therapies; and the nature and extent of genetic engineering of cells in the 
patient. 

15 The drugs can also be provided in a pharmaceutical pack or kit comprising one or' 

more containers fiilec with one or more of the ingredients of the pharmaceutical 
compositions. Optionally associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceutical or biological products, which notice reflects approval by the agency of 

20 manufacture, use or sale for human administration. 



The full contents of all references cited in this document, including references from 
25 the scientific literature, issued patents and published patent applications, are hereby 
expressly incorporated by reference. 

The following examples contain important additional information, exemplification and 
guidance which can be adapted to the practice of this invention in its vanous embodiments 
and the equivalents thereof. The examples are offered by way illustration should not be 
30 construed as limiting in any way. As noted throughout this document, the invention is 
broadly applicable and permits a wide range of design choices by the practitioner. 

The practice of this invention will employ, unless otherwise indicated, conventional 
techniques of eel! biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA, immunology, virology, pharmacology, chemistry, and pharmaceutical 
35 formulation and administration which are within the skill of the art. Such techniques are 

explained fully in the literature. See, for example, Molecular Cloning A Laboratory Manual, 
2nd Ed., ed. by Samorook. Fritsch and Maniatis (Cold Spnng Harbor Laboratory Press: 
1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis 
(M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; Nucleic Acid Hybnd.zaton 
40 (B. D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & 
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S.J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc.. 1987); 
immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To 
Molecular Cloning (1 984); me treatise, Methods In Enzymoiogy (Academic Press, Inc., 
N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eas., 
1987, Cold Spnng Haroor Laboratory); Methods In Enzymoiogy, Vols. 154 and 155 (Wu 
et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, 
eas.. Academic Press. London, 1987); Handbook Of Experimental Immunology, Volumes \- 
IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 
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Example 1: Construction ot plasmids encoding bundled activation domains: 

Transcripts factor fusion prote.ns were expressed from pCGNN (Attar. R.M. & G.lman. 
M Z (1992) Expression cloning of a novel zinc-finger prote.n that binds to the c-fos serum 
response element. Mo!. Cell. Biol. 12, 2432-2443). Inserts cloned into pCGNN as Xbal- 
BamHI fragments are transcribed under control of the human CMV enhancer and promoter 
and are expressed with an amino-term.nal epitope tag (a 16-amino acid portion of the 
Haemophilus influenzae hemagglutinin gene) and nuclear local.zat.on sequence from the 
SV40 large T antigen. Individual components of the transcription factors were syntnesized 
by polymerase chain reaction as fragments containing an Xbal site .mmediately upstream 
of the first codon and a Spel site, an ,n-frame stop codon, and a BamHI srte .mmeoiately 
oownstream of the last coaon. Fusion proteins comprising multiple component were 
assemoled by stepw.se insertion of Xbal-BamHI fragments into Spel/BamHI-opened 
vectors. The individual components used and their abbrev.at.ons are as follows: 

G = yeast Gal4 DNA binding domain, amino acids 1-94 

F = human FKBP12, ammo acids 1-107 

R = FRB domain of human FRAP, amino acids 2025-21 13 

S = activation domain frcm the P 65 subunrt of human NF-kB . am.no ac.ds 361-550 
V = activation domain from Herpesvirus VP16 amino acids 410-494 

L = E coli lactose reoressor, amino acids 46-360 

MT = Mmimal Tetramenzat.on domain of E. coli lactose repressor, ammo acids 324-360 

For example, pCGNN-GF2 was made by insertion of the Gal4 DNA binding domain into 
pCGNN to generate pCGNN-G. followed by the sequential insertion of 2 FKBP domains. 
PCGNN-L was made inserting the Xba1/BamH1 digested PCR tragments of lactose 
repressor coding seauences (amino acids 46-360) into PCGNN vector. PCGNN- LS was 
made by inserting p65 activation domain (amino acids 361-550) into Spe1 and BamHI 
digested PCGNN-J_express,on plasmid. PCGNN-G AL4 CB was made by inserting Xba1 
and BamHI digested fragments of c-CBL sequences into Spe1 and BamHI digested 
PCGNN-GAL4 expression plasm.d. PCGNN-MA was made by inserting Xba1 and 
BamHI digested DNA fragments containing SH3 domain coding sequences into 
Xba1/BamH1 d.gested PCGNN. PCGNN-MAS and PCGNN-MAMTS were made by 
inserting the S (p65 activation oomain) and MTS (minima! tetramenzation ooma.n fused to 
P 65 activation domain, respectively into Spe1/BamH1 digested PCGNN-MA vector. 
5xGAL4-IL2-SEAP contains 5 GAL4 sites upstream of a minimal IL2 promoter driving 
expression of the SEAP gene (a gift of J. Morgenstem and S. Ho). The retroviral vector 
P LH-5xGal4-IL2-SEAP was constructed by cloning the 5xGAL4-lL^-SEAP fragment 
described above into the vector pLH (Rivera et al. 1996, Nature Medicine 2:1028-1032: 
Natesan et al. Nature 1997 Nov 27 390:6658 349-50), which also contains the hygromycin 
B resistance gene dnven by the Moloney murine leukemia Virus long terminal repeat. 



66 



„,,„ PCTTJS9817723 
WO 99/10510 



Example 2: Generation of stable cell lines: 

Tc aenerate cel!s containing the P LH-5xGAL4-IL2-SEAP reoorter stably integrated, 
h-lper-free retrovirus, generated as descnbed (Rivera et al. 1996; Natesan et al, 1 997), 
5 was used to infect HT1 080 cells. Hundreds of hygromycin B (300 mg/ml) res.stant clones 
were pooled (HT1060 B pool) and inaividuat clones screened by transient transfection 
with pCG-GS. The most responsive clone, HT1080B, was selected for further analysis. 

Example 3: Transient Transtections 

r HT1080 cells were grown at 370 C in MEM medium containing 10% fetal calf serum, non- 
essential ammo acids ano penic.ll.n-streptomycin. Twenty-four hours before transfection, 
approximately 2 x 105 cells were seeded in each well in a 12-well plate. Cells were 
i-ansfected using Lipofectamine as recommended (Gibco BRL). Cells in each well 
received the amounts plasmids ind.cated in the figure, with or without 400 ng of reporter 
•5 plasmid with the total amount of DNA being adjusted to 1 .25 ug with pUC19. For 

experiments shown in F.g. 5. 10 ng of plasmid expressing DNA binding doma.n fusions 
and increasing amounts ol plasmid expressing p65 activation doma.n fusions were 
included. After transfection for five hrs, the medium was removed and 1 ml of fresh medium 
added 1 8-24 hrs later. 1 00 ul medium was removed and assayed for SEAP activity using 
20 a Luminescence Spectrometer (Perk.n Elmer) at 350 nm exatation and 450 nm em,ss>on. 
Where indicated, 2-5 ul of medium was also assayed for hGH protein as recommended 
(Nichols Diagnostic). 

Example 4: Delivery of bundled activation domains to the GAL4 DNA binding 
25 domain 

The basic svstem used for regulated gene expression (Fig. 1 A)involves two fusion 
proteins one containino a DNA-b.nding domain (such as GAL4) fused to a single copy of 
FKBP12 and the other containing a transcnption activation domain (such as from the p65 
subunit of NF-kB) fused to the FRB domain of FRAP (see e.g., Rivera et al). In the 
30 presence of the natural-product rapamycn, which forms a high affinity complex with FKBP 
and FRB domains, the FRB-p65 tus.on protein is efficiently recruited to the GAL4-FKBP 
fus.on protein. This basic system results in the delivery of a maximum of one p65 
activation domain per DNA binding domain monomer (F.g. 1 A). In th.s system the numoer of 
act.vat.on domains delivered to the promoter can be increased by lus.ng mult.ple FKBP 
3* moieties to GAL4, allowing each DNA binding doma.n to recruit multiple FRB-p65 activation 
domain fusions (F.g. IB). Because the fusion protein containing the activation ooma.n is 
expressed separatelv in this system, it is possible to bundle act.vat.on doma.n fus.on 
protems and deliver them to FKBP moieti es linked to the GAL4 DNA b,nd,ng doma.n. For 
example, the addition of a tetramenzat.on domain present in the E. col. lactose repressor 
40 between the FRB and activation domains should generate a fusion protein "bundle 
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ccmprising of four activation domains and FRB domains, which in the presence of 
-oimerizer can be delivered to each FKBP moiety (F.g. IC). In the configuration depicted in 
Fig 1D rapamycin mediates the recruitment of a tetrameric complex ot bundled actuation 
aomam fusion proteins to each FKBP of a Gal4-4xFKBP fusion protein, permitting 
5 recruitment of up to sixteen p65 activation aomains to a single GALA monomer. Analogous 
,mprovements on allostery-based systems, also based on bundling, are shown ,n Figs 1 E 
- 1H. 

Example 5: Transcriptional activation is proportional to the number of activation 
ic domains bound to the promoter. 

To test how bundled activation domain fusion proteins funct.on in this system, we 
transfected HT1080 B cells with piasmics expressing various transcription factor fusion 
proteins and treated the cells with 10 nM rapamycin to deliver the activation domains to the 
promoter. We observed tnat when only one RS or RLS fusion prote.n is delivered to each 
15 GAL4 monomer (GF1* RS and GF1 +RLS), bundled activation domain fusion prote.ns 

,nduced the reporter gene strongly as compared to the unbundled activation domain fusion 
proteins. This finding suggests that bundled activation domain fusion proteins, because of 
their ability to deliver more activation domains to the promoter, function as h.ghly potent 
inducers of transcnption. Furtnerrnore, our studies using various combinations of DNA 
20 binding fusion prote.ns and actuation oomam fusion proteins revealed that the level of 

reponer gene expression is roughly linear with the number of activation domains tnat can 
be delivered to a single GAL4 monomer bound to its promoter (Fig. 2A). 

The RLS fusion protein is caoable of delivering four times more p65 activation 
domain to the promoter than its unbundled counterpart, RS. In theory, FRB fusion protein 
25 containing four tanaemly reiterated P 65 activation domain (RS4) should deliver same 

number of activation domains to the promoter as RLS and therefore should have similar 
transaction capacity. To examine whether RS4 can function in a manner similar to RLS 
in the rapamycin regulated gene expression system , we transfected expression plasm.ds 
encoding the DNA b.nding receptor. GF1 , together with RS4 or RLS fusion proteins into 
30 HT1 080 B cells and analyzed the expression of the integrated reporter gene by adding 1 0 
nM rapamycin to the medium. We found that rapamycin induced the reporter gene strongly 
in cells expressing the GFtand RLS but not the GFtand RS4 comb.nat.on of fusion 
croteins indicating tnat the re.terated p55 act.vation domains are weak inducers of 
transcnption in the cmenzer system (Fig_2B). In contrast, rapamycin was aole to induce 
« reporter gene expression in the presence of the GF3 and RS4 combination of fusion 

proteins, albiet at much lower levels than the GF1/RLS combination of prote.ns Without 
being limited to a particular tneory, GF3 fusion prote.ns should recruit three times more 
activation domains to the promoter than GF1 . The finding that RS4 fusion protein can 
induce transcriptional actr.at.on much more strongly when tethered to GF3 as compared to 
40 GF1 suggests that when the concentration of activation domain fusion protein ,s very low. 
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more activation domains can be recruited to the promoter by increasing the number of 
FKBP moieties fused to the GAL4 DNA binding domain. A western blot analysis of the 
intracellular levels of the transfected proteins revealed that the amount of RS4 in the ceil is 
below the level of detection, which may explain why it acts as a poor inducer of 
transcnption. These observations strongly suggest that the bundling strategy, unlike 
reiteration, generates highly potent activation domains that are less toxic to cells. 

One possible explanation for pan or all ot the robust induction of gene expression 
by RLS fusion proteins .s that the close proximity of four FRB moities in the RLS bundle 
produces an avidity effect. To test this, we devised a strategy as illustrated in Fig. 3A. In 
theory, co-expressing a limited amount of RLS in the presence of a large excess of LS 
fus.on protein shouid promote the formation of RLS oundles containing, at most, a single 
FRB comain To examine the consequences of reducing the number of FRB oomams in the 
RLS bundle on reporter gene expression, we co-transfected HT1080 B cells with relevant 
expression plasmids and analyzed the expression of the GALA responsive gene in the 
presence of 1 0 nM rapamycin in the medium. As previously observed (see Fig. 2A). 
raoamycin induced only low levels of reporter gene expression in cells expressing GF1 
and RS fusion proteins. However, reporter gene expression was very robust in cells 
expressing GF1 and RLS fusion proteins (Fig. 3B). To our surprise, in cells expressing 
GF1 a limited amount ot RLS and a large excess of LS fusion protein, rapamycin induced 
o reporter gene expression to even higher levels than those achieved by GF1 and RLS 
fusion proteins alone (Fig. 3B). This suggests that the strong stimulation of gene 
expression by RLS fusion proteins is not dependent on the presence of multiple FRB 
domains in the bundle. Inoeed. tne data shown here indicates that the presence of multiple 
FRB domains in RLS fus.on protein actually diminishes its capacity to activate gene 
,5 expression to the max.mum possible level. It is likely that rapamycin allows multiple FRB 
domains in the RLS to make contact with more than one GAL4-FKBP monomer bound to 
the promoter, effectively reducing the numoer of activation domains del.vered. However . 
RLS bundles with a single FRB comain can make contact with only a single GAL4-FKBP 
monomer and therefore can recruit greater number of activation domains to the promoter. 
30 leading to a slight increase in the target gene expression. 

To assess the consequences of reducing the number of activation domains in the 
RLS fusion protein, we expressed excess amounts of lactose repressor reg.on (L, amino 
acds 46-340) relative to RLS, togetner with the DNA binding protein GF1 and induced 
reporter oene expression by adomg 10 nM rapamycin to the medium. In this situation, tne 
3 c tetrameric bundles formed should contain a maximum of one activation domain and one 

FRB domain. Because reducing the number of FRB domains in the RLS bundle increased 
reporter gene expression, any mnibit.on of reporter gene expression in the presence of 
excess L reg.on relative to RLS can be attributed to a decline in the number ot activation 
doma.ns recruited to the promoter. The data in Fig. 3B show that an excess of a portion of 
40 the lactose repressor inhibits rapamyc.n-induced reporter gene expression in cells 
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expressing GF1 and RLS fusion proteins. A western blot analysis of the recombinant 
prote.ns in the transfected cells shows a good correlation between the amount of plasmid 
used in the transaction and the corresponds expression level of protein. Taken together, 
these observations strongly suggest that the RLS fusion proteins function as potent 
inducers of transcription primarily because of their ability to deliver sign.f.cantiy more 
activation aomains to the promoter. 

Example 6: Activation of transcription using a minimal tetramerization domain 
and synergizing activation domains 

The expenments cescrioed used the lactose repressor (minus its DNA b.nding 
domain) as the bundlmc comam in fus.no prote.ns also containing the FRB and activation 
domains. In addition to tne tetramerization domain, this portion of lactose repressor contains 
the lactose b.nding domain and the flanking linker regions. To determine whether the 
tetramerization domain of lactose repressor alone is sufficient for bundling fusion proteins, 
we made an expression olasm.d. RMTS, in which the lactose repressor coding seauences 
(ammo acids 46-360) in tne RLS fusion protein was replaced with a th,rty-s.x am.no ac.d 
region between am.no acids 324 and 360 containing the tetramerization domain and a 
pomon of upstream l.nker reg.on (MT). We have found that combination of P 65 and VP16 
activation domains wnen fused to GAL4 DNA binding domain synergistically induced GAL 
responsive genes. To exam.ne whether they behave similarly when bundled together 
using the minimal lactose repressor minimal tetramenzation domain, we generated two 
additional plasmids. RMTSV and RMTV in which the VP 16 activation doma.n (am.no ac.ds 
419-490) was fused to RMTS or RMT respectively. We then co-transfected plasm.ds 
expressing appropnate combinations of fusion proteins (Fig. 4) into HT1080 B cells 
carrying a stably .ntegrated GAL4 responsive reporter gene and treated the cells with 
rapamycin to stimulate target gene expression. We obsen/ed that in cells expressing 
GF4/RMTSV and GF4/RMTS combination of fusion proteins, rapamyc.n induced the 
reporter gene expression to roughly six and three fold higher than GF4/RS combination of 
fusion proteins. In cells expressing GF4/RMTV or GF4/RSV combinations of fus.on 
proteins rapamycin induced the reporter gene only marginally higher than the levels 
induced by GF4/RS fusion proteins (F.g. 4). Although the fold induction of reporter gene 
expression by GF4/RMTS and GF4/RMTSV is slightly lower than GF4/RLS and 
GF4/RLSV three and six fold compared to four and eight fold respectively (see figure 2A). 
strong stimulation of gene expression by the activation domain fusion proteins containing 
the lactose repressor minimal tetramenzation domain suggest that the minimal 
tetramerization aomain is sufficient to bundle fusion proteins. 
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Example 7: Bundling reduces the threshold number of activators required to 
induce peak levels oi gene expression: 

If the strong stimulation of gene expression .nauced by the bundled fus.cn prote.ns 
conta.n,ng p65 activation domains is s.mply due to their ability to deliver more actwat.on 
domains to the promoter, a lower level of fusion protein containing the act.vat.cn domain 
should be sufficient in the case of bundling, as compared to unbundled act.vat.on domains, 
to strongly stimulate reporter gene expression. In tne d.menzer system, the number ot 
reconstituted activators formed can be controlled either by ad,ust.ng the amount of 
activation domain fusion prote.ns or by varying the amount of rapamycn addled to the 
medium We have employed both cf these complementary approaches to address the 
question of whether bundling of act,vat,on doma.ns reduces the threshold amount of 
activators required for robust express.on of the reporter gene. In the f.rst approach, vary.ng 
amounts of bundled act.vat.on doma.ns. RMTS and RMTSV. or the, unbundled 
counterpart. RS. were expressed in HT1080 B cells together with a fixed amount of GF4, 
the DNA binding receptor (F,g. 5AV The activators were reconstituted by the addition of >0 
nM rapamycn to the medium. The level of recombinant proteins expressed in the 
transfected cells was determ.ned by western b.ot analysis (F.g. SB). « «he^«^ 
activation doma.ns expressed, rapamycn failed to induce transcnpt,on of the reporter gene 
in cells expressing the GF4 + RS combined of fusion proteins. However^ we observed 
robust activation of reporter gene expression in cells containing the GF4+RMTS or 
RMTSV combination of fusion prote.ns. When tne activation domain fusion prote.ns . were 
present at high levels, rapamycn .nducod reporter gene express.cn to approximately four- 
and two-fold higher levels in cells containing the GF4+RMTSV and GF4+RMTS 
combination of fus,on proteins, respectively, as compared to GF4+RS tus.on P^™- 
,ndeed, the level of reporter gene express.on induced by the lowest amounts of RMTSV 
exceeded the level stimulated by the highest amount cf RS fusion prote.ns .n the cell (F.g. 
5A) These observat.ons suggest that peak levels of reporter gene express.on can be 
achieved with fewer reconstituted activators conta,n,ng bundled activation domains than 

with their unbundled counterparts. 

in the second complementary approach, we transfected HTfOSO B cells with a fixed 
amount of the expression p.asm.ds used in figure 5B and induced the reconstitutior , c , the 
activators by adding vary.ng amounts of rapamycn to the medium. In the presence of the 
GF4 DNA binding receptor, both RMTSV and RMTS fus.on proteins induced the reporter 
gene expression rooustly at 1 nM rapamycn in the medium. At this concentration of 
rapamycn in the medium, the GF4+RS combination of fus.on prote.ns failed to induce the 
reporter gene significantly above background leve!s. In all cases, we observed peak 
levels of reporter gene exoress.cn in the presence o, 10 nM rapamycn in the med.urr j(Rg. 
5B) Collectively, the finding that relatively low numbers of activators conta.n.ng mutople 
bundled activation domains are suff icient to strong.y induce gene expression suggests that 
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the tnreshold amount of activators required tor peak levels of gene exoression can be 
significantly lowered by .ncreasmg the potency of activators. 

Example 8: Bundling activation domain fusion proteins in the two-hybrid system 

enhances its sensitivity: 

The finding that robust expression of target genes can be achieved in the presence 
of relatively lew reconstituted act.vators containing bundled, but not unbundled, activation 
domain fusion proteins has important implications in two-hybnd assays. Although the two- 
hybrid system is a highly sensitive assay to detect protein-prote.n ,nteract,ons .n vwo, a 
number of factors may curtail the interaction between two hybrid proteins expressed in the 
cell One frequently faced problem with the two hybnd system is that eukaryot.c cells, 
cecause of their highly conserved biochemical regulatory pathways, often exhibit poor 
tolerance to high leve.s of the hybnd proteins, particularly those contain.ng the potent VP1 6 
activation domain, resulting in the very poor expression of fusion proteins in these cells, or 
in some cases, cell death. Because the success of this assay is dependent on the two 
hybnd proteins finding each other, it is essential that one or both of the hybrid prolans, 
preferably the fusion protein containing the activation domain, is present at relafively high 
amounts to promote the interaction between the two hybnd proteins. 

To examine whether the use of bundled activation domain fusion proteins would 
allow detection of protein-protein interactions that were previously undetectable in 
mammalian two-hybnd assays, we chose to study the interacts between two proteins, 
namely, the proto-oncogene C-Cbl and the C-Src SH3. The proline-rich domains of he 
C-Cbl proto-oncogene have been shown to bind to the SH3 domains of a number of 
signing proteins both in in vrtro and in yeast two-hybrid assays. However, in mammalian 
two-hybrid expenments, the GAL4-CBL and Src SH3-VP16 hybrid proteins failed to 
induce the expression of a stably integrated reporter gene. To exam.ne whether 
expressing "bundled" Src SH3-activat,on domain fusion protein together with GAL4-CBL 
would stimulate the GALA responsive gene, we made appropnate plasmids for exoress.ng 
the fusion proteins shown schematically in Fig. 6A and B, and introduced relevant 
combinations of expression plasmids into HT1080 B cells by transient transaction. We 
observed that neither GCBL alone, nor GCBL in the presence SH3-VP16 or SH3-p65 
induced the reponer gene express,on to detectable levels. However, -n the presence of 
tne bundled fusion proteins . SH3-LVP16 or SH3-L P 65, GCBL induced the reporter gene 
very strongly. These results show that the use of bund.ed activation domain tus.on prote.n 
can significantly .mprove the sensitivity of the two-hybrid assay (F.g. 6C). To assess 
whether the unbundled activation domain fusion proteins fail to induce the reponer gene 
expression due to their low intracellular levels, we earned out western blot analysis of 
ivsates trom the transfected cells. A representee western blot shown .n F.g. 6C 
illustrates that the unbundled fusion proteins, SH3-VP1 6 and SH3-p6S^ ™Z™*L 
present a. h^her amounts than their bundled counterparts. SH3-LVP16 and SH3-Lp6= 
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Fig 6C) suggesting that the lack o( reporter gene activation is not linked to the overall 
intracellular levels of the activation domain fusion proteins. However, in a separate western 
blot probed with GAL4 antibody, we were unable to detect the presence of GaW-CBL. 
suggesting that this fus.on protein is toxic to cells. Thus, we conclude that when the DNA 
binding component (GCBL) is present in very low amounts in the cells, only tne bundled 
activation domain fus.on proteins are capable of delivering a sufficient number of act.vat.on 
domains to the promoter for transcriptional activate of the reporter gene to occur. Taken 
together, these data strongly suggest that bundling activation domain fus.on prote.ns. in 
mammalian two-hybrid assays, may greatly enhance the detection of interactions between 
two proteins when one or both of them is present at very low levels in the cell. 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation., numerous equivalents to the specific materials and methods 
described herein. Such equivalents are considered to be within the scope of this 



invention. 
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Claims: 

1 . A recomoinant nucleic ac.a encoding a fusion prote.n contain.ng a bundling doma.n and 
at least one additional domain that is heterologous thereto. 

2. The recombinant nucleic acd of claim 1 wherein the bundling doma.n is a dimenzat.on 
domain, tnmenzation aomain or tetramenzation domain. 

3 The recombinant nucleic acic o! claim 2 wherein the bundling domain is or is derived . 
from a lac repressor tetramenzaton oomain. a D 53 tetramenzation domain or a leucine 
zipper domain. 

4. The recombinant nucie.c ace of any of claims 1 -3 wherein the heterologous domain is 
a transcnption activation domain. 

5. The recombinant nucleic acd of any of claims 1 -3 wherein the heterologous domain is 
a transcription repression domain. 

6. The recombinant nude.c acd of any of claims 1 -3 wherein the heterologous domain is 
a DNA binding domain. 

7. The recombinant nucleic acd of any of claims 1 -3 wherein the heterologous doma.n is 
a ligand binding domain. 

8. The recombinant nucie.c acd of claim 4 wherein the heterologous domain is or is 
derived from a p65, VP 16 or AP domain. 

9 The recombinant nucleic acd of claim 5 wherein the heterologous domain is or is 
denved from a KRAB doma.n or a ssn-6.TUP-1 or Kruppel family suppressor doma.n. 

1 0. The recombinant nucleic acd of claim 6 wherein the heterologous domain is or is 
derived from a GAL4. lex A or composite DNA-binding domain. 

1 1 The recombinant nucleic acd of claim 7 where.n the heterologous doma.n is or is 
denved from an immunochihn, cydcohii.n. FRB, antibiotic resistance or hormone recepto- 
domain. 

1 2. The recombinant nucleic acd of claim 1 1 wherein the heterologous domain is or is 
derived from FKBP. tetR, progesterone receptor or ecdysone receptor. 
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1 , Tne recombinant nuole.c actd ot any of cla.ms 1 . 2. 3. 4, 6. 8 or 10 where.n the fusion 
Protein oompnses a bundling domain, at least one transcription activation dorr.a.n and at 
least one DNA binding domain. 

14 The recombinant nucieic acid of any of claims 1. 2. 3. 4. 7, 8, 11 or 12 where.n tne 
| U s.on protein comprises a bundling domain, at least one transition activation domain 
and at least one ligand binding domain. 

1 5 71* recombinant nucie.c acid c< any of cia.ms 1 . 2, 3. 5. 9. 7, 1 1 or 12 wherem the 
|,sior, protein comprises a bunding comain, at least one transcription repression demar 
and at least one ligand bmd.ng comain. 

1 6 7he recomb,nant nucle.c acid of any of c.a.ms 1 , 2. 3. 6. 7. 1 0. 1 1 or 1 2 wherem the 
fusion protein comprises a bund,ng oomain, at least one DNA bmd.ng comain and at least . 
one ligand binding domain. 

17 The recombinant nucleic aao of any of daims 1 . 2. 3. 4. 6. 7. 8. 10, 1 1 or 12 encoding 
a fusion protein containing a bundling domain, a ligand binding domain, a transcription 
activation domain and a DNA binding oomain. 

18 The recombinant nucle.c acid of claim 14 wherein the fus.on protein contains a lac 
repressor tetramenzation domain, at least one FRB oomain and at least one P 65 
transcription activation domain. 

n The recombinant nudeic acic of any of daims 8, 13. 14, 17 or 18 wherein the fus.on 
protein comprises at least one domain denved from a p55 transcription activation domain 
which contains one or more of the mutations of figure 7. 

20 A recombinant nude.c aad encoding a fusion protein containing at least one doma.n 
derived from a P 65 transcription actuation domain and at least one domain wh.cn ,s 
heterologous thereto, in v.-hich tne p55-denved doma.n contains one or more of tne 
mutations of figure 7. 

21 . Tne recombinant nucle.c acid of c.aim 20 where.n tne heterologous oomam is a l.gan: 
binding domain. 

22. The recombinant nucieic ac;0 ot caim 21 wherein tne ugand-binding ooma n is or ,s 
denved from an FKBP, cyclophilin or FRB domain. 
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23. The recomb.nant nucleic acid of claim 21 wherein the hgand-binding aomain is or is 
aenved from a tetR domain. 

24. The recombinant nucle,c ac.d of da.m 21 wherein the licand-binding domain ,s or is 
derived from a hormone receptor domain. 

25. The recombinant nucle.c acd of cia.m 24 wnerein the hormone receptor doma,r. is a 
steroid receptor domain. 

26. The recombinant nucle.c ac,a cf claim 20 wherein the heterologous aoma.n is a DNA 
binding domain. 

27. The recomb.nant nucie.c acid of claim 26 where.n the DNA binding domain domain is or 
is cenved from a GAL4. lex A or composite DNA-binding domain. 



28. 



A fusion protein encoded by the recombinant nucleic actd of any of claims 1-27. 



29 A nucleic acid composition compnsing 

(a) a first nucie.c acid encoding a fusion protein containing a bundhng domain, a 
l.gand binding domain and a transcription activation domain 

(b) a second nucleic acid encoding a fusion protein containing a Itgand binding 
domain and a DNA binding domain. 

30 A nucleic acid composition compnsing 

(a) a first nucleic acid encoding a fusion protein containing a bundling domain, a 
ligand binding domain and a DNA binding domain 

(b) a second nucleic acid encoding a fusion protein contain.ng a iigand binding 
domain and a transcription activation domain. 

31 A nucleic acid composition compnsing 

(a) a first nucleic acid encoding a fusion protein containing a bundling comam, a 
ligand binding domain and a transcnptjon activation domain 

(b) a second nucteic acid encoding a DNA binding domain. 

32. The nucleic ac:d composition of any of claims 29 - 31 which further compnses a target 
gene operatively linked to an expression control sequence. 

33. A nudeic acid composition compnsing 
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(a) a first nucleic acid encoding a fusion protein containing a bundling domain, a 
ligand binding domain, a transaction activation domain and a DNA binding 
domain 

(b) a second nucleic acid comprising a target gene operatively linked tc an 
expression control sequence. 

34 A nucleic acid comDOsition compnsing 

(a) a first nucleic acid encoding a fusion protein containing a bundling dcma.n.a 
transcnption activation domain and a DNA binding domain 

( b ) a second nucle.c acid comprising a target gene operatively linked to an 
expression control sequence. 

35 The nucle.c acia comoosmon of cia,m 32 or 33 which further compnses a nucleic ac-d 
encoding a fusion protein containing a bundling domain and a transcription activat.cn 
domain. 

36 The nucleic acid composition of claim 34 which further compnses a nucleic acid 
encoding a fusion protein containing a bundl.ng domain and a transcription act.vaUon 
domain. 

37 The nucleic aaa composition of cia.m 32 wnich further compnses a nucle.c acid 
encoding a fusion protein conta.ning a ligand binding domain, a bundling domain and a 
transcription activation domain. 

38. A vector comprising a nucleic acid of any of claims 1 -27. 

39. A vector comonsing a nucle.c acid composition of any of claims 29-37. 

40. The vector of claim 38 or 39 wherein the vector is a viral vector. 

41 The vector of claim 40 wherein tne vector is selected from the group consisting of 
adenoviral vectors, AAV vectors, retroviral vectors, hybrid adenovirus-AAV vectors. HSV 
vectors. 

42. The vector of claim AO or 41 which is further packaged into recombinant virus. 

43. A composition comprising 

(a) a first recomb.nant v^rus composing the nucleic acid composition of claim 29, 

30 or 31 
and 
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(b) a second recombinant virus compnsing a target gene construct comprising a 
target gene operatively linked to an expression control sequence. 

AA A composition compnsing 

(a) a first recombinant v,rus comprising the recombinant nucleic acid of claim 17 

and 

(b) a second recombinant virus compnsing a target gene construct comprising a 
target gene operatively linked to an expression control sequence. 

45 A composition compnsing 

(a) a first recombinant virus compnsing the recombinant nucleic acid of cia.m 13 

and 

(b) a second recomoinan: virus comprising a target gene construct comprising a 
target gene operatively linked to an expression control sequence. 

46 The composition of cla.m 43 or 44 wherein the second virus additionally comprises a 
nucleic acid encoding a fusion protein compnsing a bundling domain and a transcnption 
activation domain. 

47 The composition of claim 45 wherein the second virus additionally compnses a nucleic 
acid encoding a fusion protein compnsing a bundling domain and a transection act.vat.on 
domain. 

AS The composition of claim 43 wherein the second virus additionally compnses a nucleic 
acid encoding a fusion protein compns.ng a ligand binding domain, a bundling domain and a 
transcription activation domain. 

49 The composition of claim 43 or 44 wh.ch further compnses a third recombinant virus 
containing a nucleic acd encoding a fusion protein compris.ng a bundling domain and a 
transcription activation domain. 

50 The composition of claim 45 which further compnses a third recombinant virus 
containing a nucleic acid encoding a fus.cn protein composing a Dundlinc domain and a 
transcnption activat)on aomatn. 

51 The composition of claim 43 wmch further compnses a th.rd recombinant virus 
containing a nuc:e,c aad encoding a fusion protein compns-ng a iigand binding domain, a 
bundling domain and a transaction activation domain. 
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*2 The composition of any of claims 43-51 wherein the recombinant virus is selected 
from tne group consisting of adenovirus, AAV. retrov.rus, hybrid adenovirus-AAV, HSV. 

53. A method tor rendering cells capable of ligano-dependent transcnpt.on of a target gene 
oy introducing into the cell any of tne nucleic acid compositions of claims 29-33. 35 or 37 
under conditions permitting uptake by the cell of nucleic acids. 

54. A method for rendering cells capable of ligancMependent transcription of a target gene 
by introducing into the cell any of the compositions of claims 43. 44, 46, 47, 49 or 51 

55. The metnod of claim 53 or 54 wherein the compositions are .ntroduceo ex vivo. 
55. Tne method of claim 53 or 54 wnerem the compositions are introduced in vivo. 

57. A host cell containing a nucleic acic of any of claims 1-27. 

58. A host cell containing a nucleic acid composition of any of claims 29-33. 35 or 37 . 

59. A host cell containing a nucleic acid composition of claim 34 or 36. 

60. A host cell containing a composition of any of claims 43, 44, 46, 47, 49 or 51 

61 . A host cell containing a composition of any of claims 45, 47. or 50. 

62. A host cell prepared by the method of any of daims 53-56. 

63 A method for reculatmc express.on of a target gene by adding a cell permean, Ugand 
to the host cell of any of claims 58, 60 or 62. wherein the cell permeant Ugand bmas to the 
ligand binding domains of the fusion proteins and activates gene expression. 

64. The method of claim 63 wherein the host cell is in a whole organism. 

65. The method of claim 64 wherein tne organism is a mammal. 

66. The method cf claim 65 wherein the cells are cf pnmate ongm and the mamma, is a 
phmate. 

67. The method of claim 66 wherein the pnmate is a human. 
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68 A DNA vector containing a recombinant DNA sequence compris.ng a first portion 
encoding a fusion protein containing a bundling domain and an add,t,onal oomain that ,s 
heterologous thereto and a second portion comprising a clon.ng site tor the insertion ot a 
DNA sequence of interest. 

69. A cell containing reccmbinant nucleic acids encoding 

(a) a first fusion prote.n compnsing a bundling domain, a transcription activation 
oomam and one member of a peptide binding pair, 

(b) a second fus.cn prote.n compris.ng a DNA-b,no.ng doma,n and the other 
member of the peptice binding pair, 

where.n the peptide b.nd.ng pair comprises (i) a peptioe ligand and (« a pept.de 
binding domain capable of binding to the peptide ligand, and 

wherein the cell further contains a reporter gene wh.ch is linked to an express.on 
control sequence which permns reporter gene expression upon association of the two 
fusion proteins. 

70 A genetically engineered host cell which comprises 

(a) a reporter oene linked to a regutatable expression control element, 

(b) a first recombinant nucleic ac,d encoding a fusion protein compns.ng a DNA 
binding doma.n linked to a protein domain of interest and 

(c) a second recomomant nucleic aad oompnsing a clon.ng site linked to a 
nucleic acid sequence encoding a fusion protein containing a bunding 
domain and a transcription activation domain 

wherein association of the fusion proteins activates expression of the reporter gene. 

71 A genetically engineered host cell which compnses 

(a) a reporter gene linked to a regulatable express.on control element. 

(b) a first recomb.nant nucleic acid encoding a fusion prote.n compns.ng a DNA 
binding doma.n linked to a protein domain of interest and 

(c) a second recombinant nucleic acid comprising a member of a test library 
linked to a nucleic aad sequence encoding a fusion prote.n containing a 
bundling domain and a transcription activation domain 

wnerein association of the fusion proteins activates expression of tne reporter gene. 

72 A geneticallv engineered host cell wh.ch comprises 

(a) a reporter gene linked to a regulatable expression control element, 

(b) a first recomb-nant nucfe.c acid encod.ng a fusion protein compns.ng a 
transection activation domain linked to a prote.n domain of interest and 
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(c) a second recombinant nudeic acid compnsing a cloning site linked to a 
nucleic acid sequence encoding a fusion protein containing a bundling 
domain and a DNA binding domain 
wherein association of the fusion proteins activates express.on of the reporter gene. 

73. A genetically engineered host cell which comprises 

(a) a reporter gene linked to a regulatable expression control element. 

(b) a first recombinant nucleic acid encoding a fusion protein comprising a 
transcription activation domain linked to a protein domain of interest and 

(c) a second recombinant nucleic acid comprising a member of a test Horary 
linked tc a nucleic acid sequence encoding a fusion protein containing a 
bundling domain ana a DNA binding domain 

wherein assoc.at.on of the tus.on proteins activates express.on of the reporter gene. 

74. A method for identifying a moiety capable of binding to a protein or protein domain of 
interest which comprises the steps: 

(a) contacting genetically engineered cells of claims 69-73 with members of a 
combinatorial library under suitable conditions permitting gene express.on, 

(b) observing the presence and/or amount of expression of the reporter gene. 

and 

(c) correlating the presence and'or amount of reporter gene expression with 
contact of cells witn one or more .nd.vidual members of the comb.natonal 
library. 
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